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Methods for Obtaining Plant Varieties 

TECHNICAL FIELD 

The present invention relates to nucleotide sequences which encode polypeptides 
involved in the DNA mismatch repair systems of plants, and to the polypeptides encoded 
5 by those nucleotide sequences. The invention also relates to nucleotide sequences and 
polypeptide sequences for use in altering the DNA mismatch repair system in plants. The 
invention also relates to a process for altering the DNA mismatch repair system of a plant 
cell, to a process for increasing genetic variations in plants and to processes for obtaining 
plants having a desired characteristic. 

1 0 BACKGROUND OF THE INVENTION 

Plant breeding essentially relies on and makes use of genetic variation which occurs 
naturally within and between members of a family, a genus, a species or a subspecies. 
Another source of genetic variation is the introduction of genes from other organisms 
which may or may not be related to the host plant. 

15 Allelic loci or non-allelic genes which constiuite or contribute lo desired quantitative 
(e.g. growth performance, yield, etc.) or qualitative (e.g. deposition, content and 
composition of seed storage products; pathogen resistance genes: etc.) traits that are 
absent, incomplete or inefficient in a species or subspecies of interest are typically 
introduced by the plant breeder from other species or subspecies, or de novo. This 

20 introduction is often done by crossing, provided that the species to be crossed are sexually 
compatible. Other means of introducing genomes, individual chromosomes or genes into 
plant cells or plants are well known in the an. They include cell fusion, chemically aided 
transfection (Schocher et aL, 1986, Biotechnology 4: 1093) and ballistic (McCabe et al., 
1988, Biotechnology 6: 923), microinjection (Neuhaus et al., 1987, TAG 75: 30), 

25 electroporation of protoplasts (Chupeau et al., 1989, Biotechnology 7: 53) or microbial 
transformation methods such as Agrobacterium mediated transformation (Horsch et al., 
1985, Science 227: 1229; Hiei et al., 1996, Biotechnology 14: 745). 

However, when a foreign genome, chromosome or gene is introduced into a plant, it 
will often segregate in subsequent generations from the genome of the recipient plant or 
30 plant cell during mitotic and meiotic cell divisions and, in consequence, become lost from 
the host plant or plant cell into which it had been introduced. Occasionally, however, the 
introduced genome, chromosome or gene physically combines entirely or in part with the 
genome, chromosome or gene of the host plant or plant cell in a process which is called 
recombination. 

35 Recombination involves the exchange of covalent linkages between DNA molecules 
in regions of identical or similar sequence. It is referred to here as homologous 
recombination if donor and recipient DNA are identical or nearly identical (at least 99% 
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base sequence identity), and as homeologous recombination if donor and recipient DNA 
are not identical but are similar (less than 99% base sequence identity). 

The ability of two genomes, chromosomes or genes to recombine is known to 
depend largely on the evolutionary relation between them and thus on the degree of 
5 sequence similarity between the two DNA molecules. Whereas homologous 
recombination is frequently observed during mitosis and meiosis, homeologous 
recombination is rarely or never seen. 

From a breeder's perspective, the limits within which homologous recombination 
occurs, therefore, define a genetic barrier between species, varieties or lines, in contrast 

10 to homeologous recombination which can break this barrier. Homeologous recombination 
is thus of great importance for plant breeding. Accordingly there is a need for a process 
for enhancing the frequency of homeologous recombination in plants. In particular, there 
is a need for a process of increasing homeologous recombination to significantly shorten 
the length of breeding programs by reducing the number of crosses required to obtain an 

15 otherwise rare recombination event. 

At least in Escherichia coli. homologous and homeologous recombination are known 
to share a common pathway that requires among others the proteins RecA, RecB, RecC, 
RecD and makes use of the SOS induced RuvA and RuvB, respectively. It has been 
suggested that mating induced recombination follows the Double-Strand Break Repair 

20 model (Szostak et al., 1983, Cell 33, 25-35), which is widely used to describe genetic 
recombination in eukaryoies. Following the alignment of homologous or homeologous 
DNA double helices the RecA protein mediates an exchange of a single DNA strand from 
the donor helix to the aligned recipient DNA helix. The incoming strand screens the 
recipient helix for sequence complementarity, seeking to form a heteroduplex by hydrogen 

25 bonding the complementary strand. The displaced homologous or homeologous strand of 
the recipient helix is guided into the donor helix where it base pairs with its counterpart 
strand to form a second heteroduplex. The resulting branch point dien migrates along the 
aligned chromosomes thereby elongating and thus stabilising the initial heteroduplexes. 
Single stranded gaps (if present) are closed by DNA synthesis. The strand cross overs 

30 (Holliday junction) are eventually resolved enzymatically to yield the recombination 
products. 

Although in wild type E. coll homologous and homeologous recombination are thus 
mechanistically similar if not identical, homologous recombination in conjugational 
crosses £. coli x £. coli occurs five orders of magnitude more frequently than 
35 homeologous recombination in conjugational crosses E. coli x 5. typhimurium (Matic ei 
al. 1995; Cell 80, 507-515). The imbalance in favour of homologous recombination was 
shown to be caused largely by the bacterial MisMatch Repair (MMR) system since its 
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inaciivation increased the frequency of homeologous recombination in E, coli up to 1000 
fold (Rayssiguier ei al. 1989, Nature 342. 396-401). 

In E. colL the MMR system (reviewed by Modrich 199L Annual Rev Genetics 25, 
229-253) is composed of only three proteins known as MutS, MutL and MutH. MutS 
5 recognizes and binds to base pair mismatches. MutL then forms a stable complex with 
mismatch bound MutS. This protein complex now activates the MutH intrinsic single 
stranded endonuclease which nicks the strand containing the misplaced base and thereby 
prepares the template for DNA repair enzymes. 

During recombination, MMR components inhibit homeologous recombination. In 
10 vitro experiments demonstrated that MutS in complex with MutL binds to mismatches at 
the recombination branch point and physically blocks RecA mediated strand exchange and 
heteroduplex formation (Worth et ah, 1994; PNAS 91, 3238-3241). Interestingly, the 
SOS dependent RuvA.B mediated branch migration is insensitive to MutS/MutL, 
explaining the observed slight increase in SOS dependent homeologous recombination. 
15 Homeologous maiing even induces the SOS response, thereby taking advantage of RuvAB 
mduction (Matic et al. 1995. Cell 80. 507-515). 

The MMR system thus appears to be a genetic guardian over genome stability in £. 
coli. In this role it essentially determines the extent to which genetic isolation, that is,, 
speciaiion, occurs. The diminished sensitivity of the SOS system to MMR, however, 
20 allows (within limits) for rapid genomic changes at times of stress, providing the means 
for fast adaptation to altered environmental conditions and thus contributing to intraspecies 
genetic varianon and species evolution. 

The important role of MMR in preserving genomic integrity has been established 
also in certain eukaryotes. In its efficiency, the human MMR, for example, may even 
25 counteract potential gene therapy tools such as triple-helix forming oligonucleotides 
including RNA-DNA hybrid molecules (Havre et al., 1993, J. Virology 67: 7234-7331; 
Wang et al., 1995, Mol. Cell. Biol. 15: 1759-1768; Kotani et al., 1996, Mol. Gen. 
Genetics 250: 626-634; Cole-Strauss et al., 1996, Science 273: 1387-1389). Such 
oligonucleotides are designed to introduce single base changes into selected DNA target 
30 sequences in order to inactivate for example cancer genes or to restore their normal 
function. The resulting base mismatches however are recognised by the mismatch repair 
system which then directs removal of the mismatched base, thereby reducing the 
efficiency of oligonucleotide induced site-specific mutagenesis. 

To date, two families of related genes, homologous to" the bacterial MutS and MutL 
35 genes have been identified or isolated in yeast and mammals (recent reviews by Amheim 
and Shibata, 1997, Curr. Opinion Genet. Dev. 7. 364-370; Modrich and Lahue, 1996, 
Annual Rev. Biochem. 65, 101-133; Umar and Kunkel, 1996, Eur. J. Biochem. 238, 297- 
307). Biochemical and genetic analysis indicated that eukaryotic MutS homologs (MSH) 
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and MutL homologs (MLH. PMS), respectively, tlilfil similar protein functions as their 
bacterial counterparts. Their relative abundance, however, could reflect different 
mismatch specificity and/or specialisation for different tissues or organelles or 
developmental processes such as mitotic versus meiotic recombination. 

5 To date, six different genes homologous to MutS have been isolated in yeast 
iyMSH), and their homologs have been found in mouse (mMSH) and human (hMSH), 
respectively. Encoded proteins yMSH2, yMSH3 and yMSH6 appear to be the main MutS 
homologs involved in MMR during mitosis and meiosis in yeast, where the 
complementary proteins MSH3 and MSH6 alternatively associate with MSH2 to recognise 

10 different mismatch substrates (Masischky et ah, 1996, Genes Dev. 10, 407-420). Similar 
protein interactions have been demonstrated for the human homologs hMSH2, hMSH3 
and hMSH6 (Acharya et al., 1996, PNAS 93, 13629-13634). 

MutL homologs (MLH and PMS), recently reviewed by Modrich and Lahue (1996, 
Annual Rev. Biochem. 65, 101-133) have so far been found in yeast (yMLHl and 

15 yPMSl), mouse (mPMS2) and human (hMLHL hPMSl and hPMS2). The hPMSZ is a 
member of a family of at least 7 genes (Horii et al., 1994, Biochem. Biophys. Res. 
Commun. 204, 1257-1264) and its gene product is most closely related to yPMSl. Prolla 
et al. (1994, Science 265, 1091-1093) presented evidence for yPMSl and yMLHl to 
physically associate with each other and, together, to interact with the MutS homolog 

20 yMSH2 to form a ternary complex involved in mismatch substrate binding. 

However, while medical interest in mismatch repair has prompted extensive research 
on MMR in bacteria, yeast and mammals, MMR genes have not been isolated from higher 
plants prior to the present invention and no attempts to adjust the plant MMR to plant 
breeding needs have been reported. 

25 SUMMARY OF THE INVENTION 

According to a first embodiment of the invention, there is provided an isolated and 
purified DNA molecule comprising a polynucleotide sequence encoding a polypeptide 
functionally involved in the DNA mismatch repair system of a plant. In one form of this 
embodiment, the invention provides an isolated and purified DNA molecule comprising a 

30 polynucleotide sequence encoding a polypeptide which is homologous to a mismatch 
repair polypeptide of a yeast or of a human. More particularly, the invention provides 
polynucleotide sequences encoding polypeptides which are homologous to the mismatch 
repair polypeptides MSH3 and MSH6 of Saccharomyces cerevisiae. Still more 
particularly, the invention provides the coding sequences of the genes AtMSH3 and 

35 AtMSH6 of Arabidopsis thaliana. as defined hereinbelow, and polynucleotide sequences 
encoding polypeptides which are homologous to polypeptides encoded by AtMSH3 and 
AtMSH6, 
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According to a second embodiment of the invention, there is provided an isolated 
and purified polypeptide functionally involved in the DNA mismatch repair system of a 
plant, for example a polypeptide which is homologous to a mismatch repair polypeptide of 
a yeast or of a human such as a polypeptide encoded by the genes AtMSH3 or AtMSH6 of 
5 Arabidopsis thaliana, as defined hereinbeiow. 

According to a third embodiment of the invention, there is provided an isolated and 
purified DNA molecule comprising a polynucleotide sequence selected from the group 
consisting of (i) a sequence encoding a polynucleotide which is capable of interfering with 
the expression of a plant polynucleotide sequence encoding a polypeptide which is 
10 homologous to a mismatch repair polypeptide of a yeast or of a human and thereby 
disabling said plant polynucleotide sequence; and (ii) a sequence encoding a polypeptide 
capable of disrupting the DNA mismatch repair system of a plant. 

According to a fourth embodiment of the invention there is provided a chimeric gene 
comprising a DNA sequence selected from the group consisting of (i) a sequence encoding 

15 a polynucleotide which is capable of interfering with the expression of a plant 
polynucleotide sequence encoding a polypeptide which is homologous to a mismatch 
repair polypeptide of a yeast or of a human and thereby disabling said plant polynucleotide 
sequence, and (ii) a sequence encoding a polypeptide capable of disrupting the DNA 
mismatch repair system of a plant: together with at least one regulation element capable of 

20 functioning in a plant cell. Examples of such regulation elements include constitutive, 
inducible, tissue type specific and cell type specific promoters such as 35S, NOS, PRla, 
AoPRl and DMCl. Typically, a chimeric gene of the fourth embodiment will also 
include at least one terminator sequence, more typically exactly one terminator sequence. 

In the third and fourth embodiments, said interference, by said polynucleotide 
25 sequence, with the expression of a plant polynucleotide sequence encoding a polypeptide 
which is homologous to a mismatch repair peptide of a yeast or a human typically occurs 
by hybridisation or by co-suppression. 

According to a fifth embodiment of the invention there is provided a plasmid or 
vector comprising a chimeric gene of the fourth embodiment. A vector of the fifth 
30 embodiment may be, for example, a viral vector or a bacterial vector. 

According to a sixth embodiment of the invention, there is provided a plant cell 
stably transformed, transfected or electroporated with a plasmid or vector of the fifth 
embodiment. 

According to seventh embodiment of the invention, there is provided a plant 
35 comprising a cell of the sixth embodiment. 

According to an eighth embodiment of the invention, there is provided a process for 
at least partially inactivating a DNA mismatch repair system of a plant cell, comprising 
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transrorming or transfecting said plant cell with a DNA sequence of the third embodiment 
or a chimeric gene of the fourth embodiment or a plasmid or vector of the fifth 
embodiment, and causing said DNA sequence to express said polynucleotide or said 
polypeptide. 

5 According to a ninth embodiment of the invention, there is provided a process for 
increasing genetic variation in a plant comprising obtaining a hybrid plant from a first 
plant and a second plant, or cells thereof, said first and second plants being genetically 
different; altering the mismatch repair system in said hybrid plant; permitting said hybrid 
plant to self-fertilise and produce offspring plants; and screening said offspring plants for 

10 plants in which homeologous recombination has occurred. For example, homeologous 
recombination may be evidenced by new genetic linkage of a desired characteristic trait or 
of a gene which contributes to a desired characteristic trait. 

According to a tenth embodiment of the invention there is provided a process for 
obtaining a plant having a desired characteristic, comprising altering the mismatch repair 
15 system in a plant, cell or plurality of cells of a plant which does not have said desired 
characteristic, permitting mutations to persist in said cells to produce mutated plant cells, 
deriving plants from said mutated plant cells, and screening said plants for a plant having 
said desired characteristic. 

In a preferred form of the ninth and tenth embodiments of the invention, the step of 
20 altering the mismatch repair system comprises introducing into said hybrid plant, plant, 
cell or cells a chimeric gene of the fourth embodiment and permitting the chimeric gene to 
express a polynucleotide which is capable of interfering with the expression of a plant 
polynucleotide sequence in a mismatch repair gene of the hybrid plant, plant, cell or cells, 
or a polypeptide capable of disrupting the DNA mismatch repair system of the hybrid 
25 plant or cells. 

In other embodiments, the invention provides (a) an oligonucleotide capable of 
hybridising at 45°C under standard PGR conditions to a DNA molecule of the first 
embodiment; (b) an oligonucleotide capable of hybridising at 45°C under standard PGR 
conditions to the DNA of SEQ ID NO: 18 and (c) an oligonucleotide capable of 
30 hybridising at 45°G under standard PGR conditions to the DNA of SEQ ID NO:30; with 
the proviso that the oligonucleotide of (a), (b) and (c) is other than SEQ ID N0:1 or SEQ 
ID N0:2. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 provides a diagrammatic representation of the primer sequences used to 
35 isolate /IrMWJ. 

Figure 2 is a plasmid map of clone 52, showing restriction enzyme cleavage sites in 
the 5' half of the ftiU-length cDNA for AtMSHS, 
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Figure 3 is a plasmid map of clone 13. showing restriction enzyme cleavage sites in 
the 3" hair of the ftill-length cDNA for AtMSH3. 

Figure 4 is a sequence listing of the coding sequence of AtMSH3, together with a 
deduced sequence of the encoded polypeptide. 
5 Figure 5 is a protein aligruneni of yeast (Saccharomyces cerevisiae) and Arabidopsis 

thaliana MSH3 protein. 

Figure 6 provides a diagrammatic representation of the primer sequences used to 

isolate A1MSH6. 

Figure 7 is a plasmid map of clone 43, showing restriction enzyme cleavage sites in 
10 the 5' half of the full-length cDNA for AtMSH6. 

Figure 8 is a plasmid map of clone 62. showing restriction enzyme cleavage sites in 
the 3 ' half of the full-length cDN A for AtMSH6. ^ 

Figure 9 is a sequence listing of the coding sequence of AtMSH6, together with a 
deduced sequence of the encoded polypeptide. 
15 Figure 10 is a protein alignment of yeast (Saccharomyces cerevisiae) and 
Arabidopsis thaliana MSH6 protein. 

Figure 11 is a genomic sequence listing AtMSH6. 

Figure 12 is a plasmid map of plasmid pPF13. 

Figure 13 is a plasmid map of plasmid pPF14. 
20 Figure 14 is a plasmid map of plasmid pCW186. 

Figure 15 is a plasmid map of plasmid pCW187. 

Figure 1 6 is a plasmid map of plasmid pPF66. 

Figure 1 7 is a plasmid map of plasmid pPF57. 

Figure 18 is a diagrammatic representation of an antisense gene construction for use 
25 in homeologous meiotic recombination. 

Figure 19 is a plasmid map of plasmid p3243. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention is based on the inventors' discovery that there exist in higher 
plants genes which are homologous to MMR genes in E. coll, and to MMR genes in 
30 yeasts and humans. 

Thus, the inventors have identified genes, herein designated AtMSHS and AtMSHS, 
of the plant Arabidopsis thaliana which encode the proteins AtMSHS and AtMSH6. 
These plant proteins are homologous to yMSH3 and yMSH6, respectively. The present 
inventors have isolated cDNAs encoding the proteins AtMSH3 and AtMSH6 and have 
35 isolated the complete gene encoding AtMSH6. Given the teaching herein, other genes 
(for example AtMSH2. and genes of other plants) may be obtained which are involved in 
DNA mismatch repair in plants, including other genes which encode polypeptides 
homologous to MMR proteins of yeasts or humans, such as genes which encode 
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polypeptides homologous to yeast MSH2. MLHl or PMS2. or to human MLHl. PMSl or 
PMS2. For example, given the teaching herein, genes of members of the Brassicaceae 
family or of other unrelated families, for example the Poaceiie. the Solanaceae, the 
Asteraceae, the Malvaceae, the Fabaceae, the Linaceae. the Canabinaceae, the Dauaceae 

5 and the Cucurbitaceae family, and which encode polypeptides homologous to MMR 
proteins of yeasts or humans may be obtained. 

Examples of plants whose genes encoding polypeptides homologous to MMR 
proteins of yeasts or humans may be obtained given the leaching herein include maize, 
wheat, oats, barley, rice, tomato, potato, tobacco, capsicum, sunflower, lettuce, 

10 artichoke, safflower. cotton, okra, beans of many kinds including soybean, peas, melon, 
squash, cucumber, oilseed rape, broccoli, cauliflower, cabbage, flax, hemp, hops and 
carrot. 

Within the meaning of the present invention, a first polypeptide is defined as 
homologous to a second polypeptide if the amino acid sequence of the first polypeptide 

15 exhibits a similarity of at least 50% on the polypeptide level to the amino acid sequence of 
the second polypeptide. 

A procedure which may be followed to obtain genes AtMSHS and AtMSH6 is 
described in Example 1. Essentially the same technique may be applied to obtain other 
mismatch repair genes of Arabidopsis thaliana, and essentially the same technique as 

20 exemplified herein may be applied to cDNA obtained by reverse transcription of RNA 
from other plants. Alternatively, given the sequence information disclosed herein, other 
degenerate oligonucleotide primers, especially oligonucleotides of the invention which are 
capable of hybridising at 45°C under standard PGR conditions (such as the conditions 
described in Example 1 using primers UPMU and DOMU) to AtMSH3 and/or AtMSH6 

25 may be designed and obtained for use in isolating sequences of plant mismatch repair 
genes which are homologous to AtMSHS or AtMSH6, from other plants. Similarly, 
oligonucleotides of the invention which are capable of hybridising at 45''C under standard 
PGR conditions to plant mismatch repair genes of plants other than Arabidopsis thaliana 
also fall within the scope of the present invention and may be utilised to obtain mismatch 

30 repair genes of still other plants. Typically, such oligonucleotides are capable of 
hybridising at 45 "C under standard PGR conditions to a DNA molecule which encodes a 
polypeptide which is homologous to a mismatch repair polypeptide of a yeast or a human. 
The temperature at which oligonucleotides of the invention hybridise to AtMSHS and/or 
AtMSH6, or to plant mismatch repair genes of plants other than Arabidopsis thaliana, or 
35 to DNA molecules which encode polypeptides which are homologous to a mismatch repair 
polypeptide of a yeast or a human may be higher than 45°G, for example at least 50'G, or 
at least 55°C, or at least 60'C or as high as 65°C. 
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The successful gene isolation disclosed herein demonstrates tor the First time the 
existence of MMR in higher plants and indicates the presence of other plant MMR genes. 
For example, genes encoding the plant homologs of MSHl. MSH2. MSH4. MSH5, 
PMSl. PMS2 and MLHl may he identified given the teaching herein. Such genes, as 
5 well as those specifically described herein, separately or in combination, are useful in 
manipulating the plant MMR for plant breeding purposes. Thus, for example, the plant 
MMR may be altered by including in a plant cell a polynucleotide sequence as defined 
herein above with reference to the third embodiment of the invention, and causing the 
polynucleotide sequence to express either a polynucleotide which disables a plant MMR 
10 gene, or a polypeptide which disrupts the plant's MMR system. 

The DNA molecule of the third embodiment of the invention includes a 
polynucleotide sequence (herein referred to as a MMR altering gene) which may for 
example encode sense, antisense or ribozyme molecules characterised by sufficient base 
sequence similarity or complementarity to the gene to be altered to permit the antisense or 
15 ribozyme molecule to hybridise with the plant MMR gene in vivo or to permit the sense 
molecule to participate in co-suppression. Alternatively, the MMR altering gene may 
encode a protein or proteins which interfere with the activity of a plant MMR protein and 
thus disrupt the plant's MMR system. For example, such encoded proteins may be 
antibodies or other proteins capable of interfering with MMR protein function, such as by 
20 complexing with a protein functionally involved in plant MMR thereby disrupting the 
MMR of the plant. An example of such a protein is the MSH3 protein of Arabidopsis 
thaliana described herein or a protein of another plant which is homologous to the MSH3 
protein of A. thaliana. For instance, overexpression of MSH3 in a plant cell causes 
MSH2 present in the cell to be substantially completely complexed, disrupting the 
25 mismatch repair mechanism or mechanisms in the cell which are functionally dependent 
on the presence of a complex of MSH2 with MSH6. Similarly, mismatch repair 
mechanisms which depend on the presence of a complex of MSH2 and MSH3 may be 
disrupted by the overexpression of MSH6. 

A chimeric gene of the fourth embodiment, incorporating a MMR altering gene, 
30 may be prepared by methods which are known in the art. Similarly, the MMR altering 
gene may be introduced into a plant cell, regenerating tissue or whole plant by techniques 
known in the an as being suitable for plant transformation, or by crossing. Known 
transformation techniques include Agrobacterium tumefaciens or A. rhizogenes mediated 
gene transfer, ballistic and chemical methods, and electroporation of protoplasts. 

35 The MMR altering gene or genes are typically expressed from suitable promoters. 
Suitable promoters may direct constimtive expression, such as the 35S or the NOS 
promoter. Usually, however, the promoter will direct either inducible or tissue specific 
(e.g. callus; embryonic tissue; etc.), cell type specific (e.g. protoplasts; meiocytes; etc.) 
or developmental (e.g. embryo) expression of the altering gene or genes, in order for the 
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MMR system to function in tissue types or cell types, or at developmental stages of the 
plant, in which it is not desirable for the MMR system to be altered. Using such 
promoters, therefore, the activity of a MMR altering gene may be limited to a specific 
.stage during plant development or it may be altered by controlling conditions external to 
5 the plant, and the deleterious effects of a permanently disabled or altered DNA mismatch 
repair system in a plant may be avoided. Examples of suitable promoters which are not 
constitutive are known in the art and include inducible promoters such as PRla. (reviewed 
by Gatz. 1997, Annual Rev. Plant Phys. Plant Mol. Biol. 48: 89), tissue specific 
promoters such as AoPRl (Sabahattin et al., 1993, Biotechnology 11: 218), and cell-type 
10 specific promoters such as DMCl . 

A chimeric gene in accordance with the invention may further be physically linked 
to one or more selection markers such as genes which confer phenotypic traits such as 
herbicide resistance, antibiotic resistance or disease resistance, or which confer some 
other recognisable trait such as male sterility, male fertility, grain size, colour, growth 
15 rate, flowering time, ripening time. etc. 

The process of the tenth embodiment of the invention provides, for example, a 
process for generating iniraspecies genetic variation by altering the mismatch repair 
system in a plant cell, in regenerating plant tissue or in a whole plant. The plant cell, 
regenerating tissue or whole plant includes and expresses one or more MMR altering 
20 genes which are capable of altering mismatch repair in the plant cell, regenerating tissue 
or whole plant. Alteration of MMR may be achieved, for example, by inactivating the 
genes encoding plant MSH3 and/or plant MSH6. It is preferred to inactivate the plant 
MSH3 and MSH6 encoding genes at the same lime and in the same plant cell, 
regenerating tissue or whole plant. Typically in this preferred form of the invention 
25 inactivation of either plant MSH3 or MSH6 alone is insufficient to substantially alter the 
plant's mismatch repair system and only when both MSH3 and MSH6 are inactivated 
simultaneously is the plant's mismatch repair system sufficiently altered to prevent the 
MMR system from recognising base pair mismatches, base insertions or deletions as a 
result of DNA replication errors, DNA damage, or oligonucleotide induced site-specific 
30 mutagenesis. However, in some applications of the invention, inactivation of only one 
gene may also be used to cause genomic instability or increase the efficiency of site- 
specific mutagenesis. 

If desired, the MMR altering gene or genes may later be rendered non-functional or 
ineffective, or may be removed from the genome of the plant cell, regenerating tissue or 
35 whole plant in order to restore mismatch repair in the plant cell, regenerating tissue or 
whole plant. The MMR altering gene or genes may be inactivated by means of known 
gene inactivation tools, such as ribozymes. or may be removed from the genome using 
gene elimination systems known in the an, such as CRE/LOX. It is preferred to render 
two genes, whose gene products combine to incapacitate MMR. ineffective by separating 
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the alterin- c^nes through segregation. Therefore, in a preferred embodiment of the 
invention a fi^rst plant cell or plant is generated in which only plant MSHi is incapacitated, 
and a second plant cell or plant is generated in which only plant MSm is incapacuated. 
The combination of both genomes, for example by crossing, then produces significant 
5 MMR deficiency in those cells or plants which have inherited both altering genes. It the 
altering genes are expressed from unlinked loci, gene segregation restores MMR activity 
in the progeny of the cells or plants. 

In a process of the ninth embodiment of this invention, homeologous recombination 
is enhanced between differem genomes, chromosomes or genes in plant cells or plants by 
10 altering MMR in said plant cells or plants. Such genomes, chromosomes or genes are 
characterised in that they originate from different plant families, genera, species, 
subspecies, plant varieties or lines. Hybrid plant cells or hybrid plants may be produced 
by crossing, by cell fusion or by other techniques known in the art. These plant cells or 
plants are further characterised by expressing one or more genes that are capable of 
15 altering mismatch repair in the plant cell or plants. 

In the process of the ninth embodiment, the homeologous recombination is typically 
for the purpose of introducing a desired characteristic in the hybrid plant. In this typical 
application of the process of the ninth embodiment, and in the process of the tenth 
embodiment the desired characteristic may be any characteristic which is of value to the 
20 plant breeder. Examples of such characteristics are well known in the art and include 
altered composition or quality of leaf or seed derived storage products (e.g. oil. starch, 
protein), altered composition or quality of cell walls (e.g. decrease in lignin content), 
altered growth rate, prolonged flowering, increased plant yield or grain yield, altered 
plant morphology, resistence to pathogens, tolerance to or improved performance under 
25 environmental stresses of various kinds, etc. 

In a preferred form of the tenth embodiment, an MMR altering gene is co- 
introduced along with the homeologous genome, chromosome or gene of another plant 
cell or plant into an MMR proficient plam cell or MMR proficient plant to produce a 
hybrid plant cell or hybrid plant in which homeologous recombination can occur. 
30 Suitably, the MMR proficient plant cell or MMR proficient plant may also include an 
MMR altering gene. For example a gene capable of inactivating plant MSH3 may be co- 
introduced along with the homeologous genome, chromosome or gene of another plant 
cell or plant into an MMR proficient plant cell or MMR proficient plant in which MSH6 
is inactivated. A resultant hybrid plant in which homeologous recombination occurs will 
35 include both the MSH3 and MSH6 altering genes and its MMR system will therefore be 
inactivated. 

In this form of the invention, if hybrid plants are to be produced by crossing, the 
MMR altering gene preferably originates from the male parent, thus ensuring that the 
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MMR altering gene is always introduced and is not present in the recipient cell. That is. 
the MMR of the recipient cell, prior to introduction of the MMR altering gene, is 
typically proficient. Alternatively, if an MMR altering gene is present in a recipient cell it 
may be ineffective or inefficient on its own. or it may be linked to an inducible or tissue 
5 specific or cell type specific promoter which only renders the MMR altering gene active 
under limited conditions. 

Thus, in a preferred form of the process of the ninth embodiment, the MMR system 
of the hybrid plant is initially unaltered. In this form of the process, the step of altering 
the mismatch repair system may comprise introducing into the hybrid plant, or cells 
10 thereof, a MMR altering gene, such as by Agrobacterium tumefaciens or A. rhizogenes 
mediated gene transfer, ballistic and chemical methods, and electroporaiion of protoplasts. 

The MMR altering gene or genes are typically expressed from suitable promoters, as 
described above. Preferably, the promoter is transcriptionally active in mitotically and 
meioiically active tissue and/or cells to ensure MMR alteration after chromosome pairing 
15 at mitosis and meiosis, respectively. The preferred liming for MMR alteration is at 
meiosis. because recombinant genomes, chromosomes or genes are directly transmitted to 
the progeny. A suitable meiocyte specific promoter is for example the DMCl promoter 
from Arabidopsis thaliana ssp. Ler. (Klimyuk and Jones, 1997, Plant J. 11, 1-14). 
However, mitotic homeologous recombination is also a desirable outcome as somatic 
20 recombination events can be transmitted to offspring due to the totipotency of plant cells 
and the lack of predetermined germ cells in plants. 

If desired, the MMR altering gene or genes may later be rendered non-functional or 
ineffective, or may be removed from the hybrid plant or hybrid plant cells, in order to 
restore mismatch repair in the hybrid plant or hybrid plant cells. The MMR altering gene 
25 or genes may be inactivated by means of known gene inactivation tools as described 
herein above. 

EXAMPLES 

Example 1 . Cloning of the AtMSH3 and AtMSH6 coding sequences 

Isolation of partial AtMSH3 and AtMSH6 consensus sequences 

30 Degenerate oligonucleotides UPMU (SEQ ID NO: 1 ) and DOMU (SEQ ID N0:2) 

UPMU CTGGATCCACIGGICCIAA(C/T)ATG 
DOMU CTGGATCC(A/G)TA(A/G)TGIGTI(A/G)C(A/G)AA 
were used to isolate AtMSH3 and AtMSH6 sequences by PGR amplification. 

Primers UPMU and DOMU correspond to conserved amino acid sequences of the 
35 proteins MutS (£. coli and S. typhimurium), HexA (S. pneumoniae), Repl (mouse) and 
Duel (human). The conserved regions to which they are targeted are TGPNM for UPMU 
(amino acid positions 852-856 for AtMSH6 and 816-820 for AtMSH3) FATHY or FVTHY 
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for DOMU (amino acid positions %4-968 for AtMSH6 and 928-932 for AtMSH3. 
respectively.) These primers have been used to isolate MSH2 and MSHI from yeast 
(Recnan and Koiodner. Genetics 132: 963-973 ( 1992)) and MSH2 from Xennpus and mouse 
( Varlet ei al.. Nuc. Acids Res. 22:5723-5728 (1994)). 

5 Template single strand cDNA was produced by reverse transcription of 2 ng total 
RNA from a cell suspension culture of Arabidopsis thaliana ecotype Columbia (Axelos et 
al. 1989, Mol. Gen. Genetics 219: 106-112). The PGR reaction was performed under the 
following conditions in a final volume of 100^1: 0.2mM dNTP, l^M each primer, IXPCR 
buffer, 1 u Taq DNA polymerase (Appligene) in the presence of template cDNA. PGR 

10 parameters were 5 minutes at 94°C, followed by 30 cycles of 40 seconds at 95°C. 90 
seconds at 45'C, 1 minute at 72°G. The amplification products were cloned into pGEM-T 
vector (Promega) and sequenced. Two different clones were isolated, S5 (350bp) was 
homologous to h4Sm, S8 (327bp) was homologous to MSH6. Complete cDNA sequences 
were then isolated according to the Marathon cDNA amplification kit procedure (Clontech). 

15 In summary, this procedure involves producing double stranded cDNA by reverse 
transcription of 2|ig polyA+ RNA from the cell suspension culture of Arabidopsis. 
Adaptors are ligaied on each side of the cDNA. The ligated cDNA is used as a template for 
5" and 3' RACE PGR reactions in the presence of primers that are specific for the adaptor on 
one side (API and AP2), and specific for the targeted gene on the other side. A 5' and a 3" 

20 fragment that overiap are thus produced for each gene. The complete gene coding sequence 
can be reconstituted taking advantage of a unique restriction site, if available, m the 
overlapping region. Specific details of this procedure as it was used to isolate AtMSH3 and 
A1MSH6 coding regions, are as follows. 
Isolation oi'AlMSHS complete coding sequence 

25 From the sequence of clone S5, primer 636 (SEQ ID N0:3) was designed: 
636 TGGTAGTGCGTCTTGCAAGGTGAT. 
Primer API (SEQ ID N0:4) is complementary to a portion of an adaptor sequence 
which had been ligated to the 5' and 3' ends of Arabidopsis cDNA: 

API CGATGCTAATACGACTCAGTATAGGGG. 
30 PGR performed on the ligated cDNA with primers 636 and API for the 5' RACE PGR 
was followed by a second round of amplification with the nested primers AP2 (SEQ ID 
N0:5) and S525 (SEQ ID N0:6) 

AP2 ACTGAGTATAGGGCTCGAGGGGC 
S525 AGGTTGTGATTATGTGTGAGGGTTTAGTTA 
35 (the latter was also designed to correspond to a pan of the sequence of clone S5) and 
produced a 2720bp DNA fragment. Figure I provides a diagrammatic representation of the 
primer sequences used to isolate AtMSH3. Another primer (85 1, SEQ ID N0:7) 
S51 GGATCGGGTACTGGGTTTTGAGTGTGAGG 
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was dcsiu.icd closer lo the .v border and permitted the determination of 99bp upstream to 
the ATCrinitiation codon. For the T RACE PGR. a first PCR reaction was performed with 
primers API and 635 (SEQ ID N():8). 

635 GCACGTGCTTGATGGTGTTTTCAC 
5 followed by a second round of amplification, using the nested primers AP2 and S523 (SEQ 
1DN0:9) 

S523 TCAGACAGTATCCAGCATGGCAGAAGTA 
which produced a DNA fragment of 890bp. Both DNA fragments were subcloned into 
pGEM-T and sequenced. Since PCR amplification using the Expand Long Template PCR 
10 System (Boehringer-Mannheim) produced errors in the sequence, new oligonucleotides 
were designed to isolate those sequences again by PCR, but with the high fidelity DNA 
polymerase Pfu. PCR with primers 185 (SEQ ID NO:10) and S53 (SEQ ID N0:1 1) 
1S5 ATCCCGGGATGGGCAAGCA.\AAGCAGCAGACGA 
S53 GACAAAGAGCGAAATGAGGCCCCTTGG 
15 amplified the 1244bp fragmem clone 52 (SEQ ID NO: 12. cloned into pUC18/Smal). PCR 
with primers S52 (SEQ ID NO: 1 3 ) and 2S5 (SEQ ID NO: 14) 

2S5 ATCCCGGGTCAAAATGAACA.AGTTGGTTTTAGTC 
S52 GCCACATCTGACTGTTCAAGCCCTCGC 
amplified the 2104bp clone 13 (SEQ ID N0:15, cloned into pUC18/Smal). The complete 
20 coding sequence of the AiMSH3 gene was reconstructed in pUC18 by ligating the 5' half 
o{AiMSH3 (clone 52) to the 3' half of AtMSH3 (clone 13) after digesting with BamRl 
which has a unique cleavage site in the overlapping region of both clones. This 
manipulation yielded plasmid pPF26. The Smal fragment from pPF26 contains the 
complete AlMSHS coding sequence. The remaining primers referred to in Figure 1 are as 
25 follows: 

S51 GGATCGGGTACTGGGTTTTGAGTGTGAGG (SEQ ID NO: 16) 

S525 AGGTTCTGATT ATGTGTGACGCTTTACTTA (SEQ ID NO: 1 7) 
Figures 2 and 3 provide plasmid maps of clones 52 and 13 respectively, showing 
restriction enzyme cleavage sites. The complete AtMSHS coding sequence (SEQ ID NO: 18) 
30 is 3246bp long and is shown in Figure 4 together with the deduced sequence (SEQ ID 
NO: 19) of the encoded polypeptide. AtMSH3 is clearly homologous to the yeast and mouse 
MSH3 genes. A sequence alignment of polypeptides encoded by AtMSH3 and that encoded 
by Saccharomyces cerevisiae MSH3 is set out in Figure 5. 
Isolation of the AtMSH6 complete coding seouen ce and genomic sequences 

The same procedure allowed isolation of the A1MSH6 cDNA. Figure 6 provides a 
diagrammatic representation of the primer sequences used to isolate AtMSH6. For the 5' 
RACE PCR, primers 638 (SEQ ID NO:20) and API (SEQ ID N0:4) 

638 TCTCTACCAGGTGACGAAAAACCG 
allowed the amplification of a 2889 DNA fragment. Primer S81 (SEQ ID N0:21) 



35 



wo 99/19492 



15 



PCT/EP98/06977 



S8 1 CGTCGCCTTTAGC ATCCCCTTCCITC AC 
helped define the I42bp upstream to the ATG initiation codon. On the 3' side. RACE PCR 
was initially performed with primers S823 (SEO ID ^0:22) and API (SEQ ID N0:4) , 

S823 GCTTGGCGCATCTAATAGAATCATGACAGG 
5 and then with the nested primers 637 (SEQ ID NO:23) and AP2 (SEQ ID N0:5). 

637 GACAGCGTCAGTTCTTCAGAATGC 
to produce a 774bp DNA fragment. As for AtMSH3, those fragments were cloned and 
sequenced. Re-isolation of the DNA sequence using the high fidelity Pfu polymerase and 
newly designed primers 1 S8 (SEQ ID NO:24) and 883 (SEQ ID NO:25) (for the 5" side) led 
10 to a 2182 bp DNA fragment identified as clone 43 (SEQ ID NO:26, cloned in 
pUClS/Smal). and a 1379bp clone identified as clone 62 (SEQ ID NO:27, also cloned in 
pUC18/Smal). 

1 88 ATCCCGGGATGC AGCGCCAGAGATCGATTTTGT 

2S8 ATCCCGGGTIATTTGGGAACACAGTAAGAGGATT (SEQ ID 

15 NO:28) 

582 GCGTTCGATCATCAGCCTCTGTGTTGC (SEQ ID NO:29) 

583 CGCTATCT.ATGGCTGCTTCGAATTGAG 

Figures 7 and 8 provide plasmid maps of clones 43 and 62 respectively, showing restriction 
enzyme cleavage sites. Clones 43 and 62 were digested by the Xmn\ restriction enzyme for 
20 which a unique site is present in their overiapping region and then ligated. The complete 
AtMSH6 coding sequence (SEQ ID NO:30) is 3330bp long and is shown in Figure 9 
together with the deduced sequence (SEQ ID N0:3 1 ) of the encoded polypeptide. AtMSHd 
is clearly homologous to the yeast and mouse .V/SZ/dgenes. A sequence alignment of 
polypeptides encoded by AtMSH6 and that encoded by Saccharomyces cerevisiae MSH6 is 

25 set out in Figure 10. 

An AIMSH6 genomic sequence was also isolated from a genomic DNA library 

constituted after partial Sau3A\ digestion of DNA from the Arabidopsis cell suspension. 

8062bp were sequenced that covered the AtMSH6 gene and show colinearity with the 

cDNA. 16 introns are found scattered along the gene. The complete genomic sequence 
30 (SEQ ID NO:98) is shown in Figure 11. 

Example 2. A measure of somatic variation in MMR deficient plants 

Constructs 

Constructs with antisense AtMSH3 or antisense AtMSH6 or both AiMSH3/AiMSH6 
under the control of a single 35S promoter have been inserted into the binary vector 
35 pPZP121 (Hajdukiewicz et al.. Plant Mol. Biol. 23, 793-799) between the right and left 
borders of the T-DNA. The pPZP121 plasmid confers chloramphenicol resistance to 
Escherichia coli or Agrobacterium tumefaciens bacteria. The aacCl gene is carried by the 
T-DNA and allows selection of transformed plant cells on geniamycin (Hajdukiewicz et al.. 
Plant Mol. Biol. 25, 989-994). For the purpose of expressing antisense constructs, a 35S 
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promoter/terminator cassette with a polylinker was introduced into pPZPlZl. The 3" ends of 
the considered genes have been chosen since this region seems more efficient for antisense 
inhibition. For AtMSH3 this corresponds to clone 13 (2104bp). for .'UXISH6 this 
corresponds to clone 62 (1379bp). Clone 13 comprises 2 1 04bp of the 3' region that were cut 
5 off the pUCl 8 vector by Sall/Sstl restriction, blunted with T4 DNA polymerase and ligated 
into the T4 DNA polymerase blunted BamHl site of pPZP121/35S, creating clone pPF13. 
The same procedure was followed for the 3' region of AtMSH6 clone 62 (1379bp) thus 
creating plasmid pPF14. For the double constructs, the 3" region (from clone 62) of 
AtMSH6 was introduced ahead of the AtMSH3 region into pPF13 creating pCWl86 and 
10 reciprocally, the 3" region of AtMSH3 (from clone 13) was introduced ahead of AtMSH6 into 
pPFM, creating pCW187. 

These constructs were introduced into the Arabidopsis cells (as described below) of 
wildtype Columbia and of the Columbia tester line. 

An alternative strategy to antisense inhibition of AtMSH6 comes from experiments of 
15 Marra et al. (1998. Proc. Natl. Acad. Sci USA 95. 8568-8573) who show that 
overexpression of functional MSH3 results in depletion of MSH6 protein in human cells. 
This depletion may generate a mismatch repair mutant phenotype. 

For the purpose of overexpressing functional AtMSH3 protein in plant cells, the 
complete MSH3 coding region was excised from pPF26 (example 1) by digestion with 
20 Smal, and was inserted into the Sma\ site of pCW164. The resulting construct was named 
pPF66. It contains a complete AtMSH3 gene under the control of the 35S promoter inside 
the left (LB) and right (RB) border of the T-DNA. This T-DNA also contains the hpt2 gene 
for gentamycin selection. Plasmid pPF66 was introduced into Arabidopsis cells as 
described below. One cell clone was selected which clearly overexpressed.the AtMSH3 
25 gene as shown by Northern analysis. Figures 12-16 provide plasmid maps of plasmids 
pPF13, pPF14, pCW186, pCWl87 andpPF66, respectively. 

Construction of tester construct 

For the purpose of Forward Mutagenesis Assays, a tester construct was built 
containing the coding regions for nptW, codA, uidk. All three genes are driven by the 35S 

30 promoter and are terminated by the 35S terminator. This construct was obtained by 
introducing an EcoRl fragment encoding the corfA cassette (2.5kb) and a HiruRW fragment 
encoding the uidk (GUS) cassette (2.4kb) into the pPZPlll vector (Hajdukiewicz et 
al.,1994, Plant Mol Biol 23: 793-799) which already contained the nptU expression cassette. 
This new plasmid was named pPF57. NptU is used to select for transformed plant cells. 

35 GUS is used to analyse the degree of gene silencing in the construct (i.e. to identify cell 
lines in which the transgenes are expressed), and codA is used as a marker for forward 
mutagenesis (described below). 
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The plasmid map of pPF57 is provided in Figure 17. 
Plant cril irnimtbrmation 

The constructs are introduced into Agrohucteriiim by electroporation. Plant cells are 
then transformed by co-cultivation. A suspension culture of Arabidopsis ihaliana cells that 

5 has been established by Axelos et al. (1992, Plant Physiol. Biochem. 30, 1-6) may be used. 
One day old freshly subcultured cells are diluted five times into AT medium (Gamborg B5 
medium, 30g/l sucrose, 200^g/l NAA). 10^1 of saturated Agrobacterium containing the 
transforming T-DNA constructs are added to 10ml diluted cells in a 100ml erlenmeyer. The 
co-cultivation is agitated slowly (80rpm) for 2 days. The cells are then washed 3 times into 

10 AT medium and finally resuspended in the same initial volume (10ml). The culture is 
agitated for 3 days to allow expression before plating on selection plates (AT/BactoAgar 
8g/l+gemamycin 50^g/ml). Transformed individual calli are isolated 3 weeks later. 

Tester Strain 

The tester construct on plasmid pPF57 was introduced into Arabidopsis cells of 
15 wildtype Columbia using the transformation protocol described above. Among 10 candidate 
transformants. one cell clone was shown (by Southern analysis) to have a unique T-DNA 
insertion. All three genes were shown to be functional in this cell line as indicated by 
resistance to kanamycin, blue staining in the presence of X-Glu (GUS), and sensitivity to 
5-fluoro-cytosine (codA). 

20 MMR altering genes (described above) were then introduced individually into the 
tester line and transformed cells are used for analysis of both Microsatellite Instability and 
Forward Mutagenesis. 

Microsatellite analysis 

Microsatellites have been described in Arabidopsis (Bell and Ecker, 1994, Genomics 
25 19. 1 37-144). The present Example is based on a study of instability of microsatellites that 

are polymorphic amongst different ecotypes, DNA is extracted from the transformed calli. 

Specific primers have been defined that are used to amplify the microsatellite sequence. 

One of the two primers is previously P32 labelled by T4 kinase. In case of a polymorphic 

variation, new PGR products appear that do not follow the expected pattern of migration on 
30 a polyacrylamide gel. This is a commonly observed feature for MMR deficient cells in 

yeast or mammalian cells. 

In particular, the present Example describes a study on microsatellites ca72 (CAig), 
ngal72 (GA29). and ATHGENEA(A39), chosen because they belong to the types 
predominantly affected in human mismatch repair deficient tumors. The size of these 
35 microsatellites is not conserved from one Arabidopsis ecotype to the other. 

Arabidopsis cells which are transformed with an MMR altering gene (above) and 
control cells not expressing the MMR altering gene are allowed to form calli. DNA is 
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rapidly cxiracted from the culli and is analysed for microsatelliie instability as described in 
detail by Bell and Ecker iW. Genomics 19, 137-144. In summary, the relevant 
microsaicllite is ampliried by PCR using P32 labelled primers. The PCR products are 
separated on a DNA sequencing gel for size determination. Size differences between 
5 microsaielliles from transformed and control cells not expressing the MMR altering gene in 
question indicate microsatelliie instability as a result of MMR alteration. 

The sequences of primers used for PCR amplification of microsatellites ca72 and 
ngal 72 are included in Table 1 , PCR amplification of microsatellite ATHGENEA made use 
of a forward primer containing the sequence 
10 ACCATGCATAGCTTAAACTTCTTG (SEQ ID NO:32) 

and of a reverse primer containing the sequence 

ACATAACCACAAATAGGGGTGC (SEQ ID NO:33). 

The amplification for microsatellite ca72 revealed in Columbia control cells (with 
respect to the MMR altering gene) a 248 bp long PCR fragment instead of the published 
15 length of 124 bp. DNA sequencing verified this fragment as a CAis microsatellite. 

Forward mutagenesis assav 

Tester cells transformed with antisense AtMSHS or antisense AtMSH6 or both 
AtMSH3/AtMSH6 are analysed for the stability of the cocJA gene. The functional codA gene 
confers to sensitivity to 5-fluoro-cytosine (5FC). whereas a gene inactivating mutation in 

20 codA will confer resistance to 5FC. The frequency of resistant cells is therefore a good 
indicator of somatic variation as a direct result of MMR alteration. Variants resistant to 5FC 
are first analysed for GUS activity. If GUS is inactive, 5FC resistance is assumed to be due 
to gene silencing (all three genes are under the 35S promoter). If GUS is active. 5FC 
resistance is assumed to be due to forward mutations that have inactivated corfA. PCR is 

25 then performed on the putative codA mutant genes which is then sequenced to confirm the 
presence of forward mutations in codA. 

Besides codA, other marker genes may also be used for the Forward Mutagenesis 
Assay such as the ALS gene (confenring sensitivity to valine or to sulfonylurea; Hervieu and 
Vaucheret, 1996, Mol. Gen. Genet. 251 220-224: Mazur et al. 1987, Plant Physiol. 55 1110- 
30 1117). 

Example 3. Homeologous meiotic recombination in Arabidopsis thaliana 

A. Construction of a meiocvte specific gene expression casset te comprising the DMC\ 
promoter and the NOS terminator 

(i) The DMCl promoter may be used as published by Klimyuk and Jones, 1997, 
35 Plant J. 11.1-14). To obtain a more convenient alternative for gene cloning, a 3.3 Kb 
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long subtragment of the DMC\ promoter was obtained by PCR from genomic DNA of 
Arabidopsis thaliana (ssp. Landsberg erecta "Ler"). 
The PCR was done in three rounds: 

Round One: A 3.7 Kb long product was obtained using the forward primer 

5 DMCIN-A comprising the sequence 

GAAGCGATATTGTTCGTG (SEQ ID NO:34) 
and the reverse primer DMCIN-B comprising the sequence 

AGATTGCGAGAACATTCC (SEQ ID NO:35). 
The weak amplification product was then used as template for round two and three. 
10 Round Two: A 3.1 Kb long product comprising the promoter and the 5' 
untranslated leader was obtained using forward primer DMCIN-1. which contained the 
sequence 

acgcgtcgacTCAGCTATGAGATTACTCGTG (SEQ ID NO:36) 
and introduced a Sail cloning site at the 5' end of the promoter fragment, and reverse 
1 5 primer DMCIN-2 which contained the sequence 

gcictagaTTTCTCGCTCTAAGACTCTCT (SEQ ID NO:37) 
and introduced a Xbal site at the 3' end of the PCR fragment. 

Round Three: A 0.2 Kb long product comprising the first exon/intron of the DMC\ 
promoter was obtained using forward primer DMCIN-3, which contained the sequence 
20 gctcugaGCTTCTCTTAAGTAAGTGATTGAT (SEQ ID NO:38) 

and introduced a Xbal site at the 5' end of the PCR fragment, and reverse primer 

DMCIN-4, containing the sequence 

tcccccgggctcgagagatctccatggTTTCTTCAGCTCTATGAATCC (SEQ ID NO:39) 

and introduced at the 3' end of the PCR product restriction sites for Ncoh BgHl, Xhol and 

25 Smal. 

The products obtained in round Two and Three were digested with Xbal and 
subsequently ligated to reconstitute a 3.3 Kb long DMCl promoter from which the first 
two in-frame ATG start codons were replaced with a unique restriction site for X&al. 
This promoter can be cloned between the restriction sites for Sail and Smal of p3264. 
30 which contains the 5acI-£coRI NOS terminator in pBIN19. to yield the entire expression 
cassette in pBIN19. This cassette contains the following cloning sites: Ncol, BgUL Xhol. 
Smal and (already present on p3264) Kpnl and Sad. 

(ii) Another strategy yielded the following convenient DMCl promoter. A 1.8 kb 
DNA fragment comprising the 3" terminal part of the meiocyte specific DMCl promoter 
35 was isolated by PCR from purified genomic DNA oi Arabidopsis thaliana (ssp. Landsberg 
erecta "Ler"). The forward PCR primer (DMCla) contained the sequence 
acgcgtcgacGAATTCGCAAGTGGGG (SEQ ID NO:40) 
and introduced a Sail cloning site at the 5' end of the promoter fragment. The reverse 
PCR primer (DMClb) contained the sequence 
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tccatggagatctcccgggtacCGATTTGCTTCGAGGG (SEQ ID NO:41) 
introducinu a polylinker region ai the 3* end of the promoter fragment. The PGR reaction 
was carried out with VENT DNA Polymerase (NEB) over 25 cycles using the following 
cycling protocol: 1 minute at 94*^0, 1 minute at 54°C, 2 minutes at 72°C. 

5 The PGR reaction yielded a blunt ended DNA fragment which was digested with 
restriction enzyme Sail and was cloned into the cleavage sites of restriction enzymes San 
and Smal in plasmid p2030, a pUCllS derivative containing the 5<2cI-£c(?RI NOS 
terminator fragment of pBIN12l. The cloning yielded plasmid p2031, containing the 
DMCl promoter-poly linker-NOS terminator expression cassette depicted in Figure 18. 

10 B. Constnicrion of an MSH3 antisense gene under the control of th e DMCl promoter 

A 2.1 Icb DNA fragment encoding the carboxy terminal part of AtMSH3 was 
removed from the polylinker of clone 13 described in Example 1 after (i) digestion with 
Kpnl, (ii) blunting of the DNA ends generated by Kpnl and (iii) digestion with BamUl. 
The isolated fragment was then cloned in antisense orientation downstream of the DMCl 
15 promoter in plasmid p2031, which had been digested with restriction enzymes Smal and 
Bglll. This cloning yielded plasmid p2033 (Figure 18). 

After digestion of p2033 with £coRI, a 4.1 kb DNA fragment was recovered 
comprising the DMCl promoter, the partial MSm cDNA in antisense orientation with 
respect to the promoter and the NOS terminator. This fragment was cloned into the EcoIG. 
20 restriction site of plant transformation vector pNOS-Hyg-SCV to yield plasmid p3242 
(Figure 18). 

C. Construction of a combined MSH6/MSH3 antisense gene u nder the control of a 
single DMCl promoter 

A 3.1 kb fragment, encoding in antisense orientation the partial A1MSH6 and AtMSHi 
25 sequences and the 35S terminator, was isolated from pCW186 by digestion with Kpnl. 
The fragment was treated with Klenow enzyme to blunt both ends. It was then cloned into 
the Smal site of plasmid p3243 (a pNOS-Hyg-SCV derivative, illustrated in Figure 19), 
which had been opened to delete the region between the Smal sites. Clones containing the 
fragment in the antisense orientation with respect to the DMCl promoter (described in 
30 A(ii) above) were identified by diagnostic digestion with BamUl. The selected construct 
was named p3261. 

Another practical way of cloning the double antisense gene is as follows. A 1 kb 
DNA fragment encoding the carboxyterminal part of AtMSH6 is isolated from clone 62 
described in Example 1 after digestion of clone 62 plasmid DNA with BamHl, which 
35 cleaves in the 5' polylinker region flanking the partial cDNA, and with £coRI, which 
cleaves within the cDNA. The isolated fragment is treated with Klenow enzyme to blunt 
both its ends and is cloned into the recipient plasmid p2033 or p3242. For the purpose of 
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cloning, the recipient plasmid may be cleaved with either Aval or Ncol and can be blunted 
with Klenow enzyme to produce blunt acceptor ends for fragment cloning. This cloning 
yields two opposite orientations of cloned fragment DNA with respect to the DMCl 
promoter. These can be identified by diagnostic digestion with restriction enzymes Seal 

5 or Xmnl in conjunction with Sad The selected construct contains the DMCl promoter, 
the combined partial cDNAs for AtMSHS and AtMSH6 (both cloned in antisense 
orientation with respect to the DMCl promoter) and the NOS terminator. If the recipient 
plasmid is p2033, the combined antisense gene under control the single DMCl promoter is 
recovered from the vector after £coRI digestion and is cloned into the £coRI restriction 

10 siteof pNOS-Hyg-SCV. 

n Construction of a full-length MSm sense gene under control o f the DMCX promoter 
for overexpression of functional MSH3 protein 

Overexpression of MSH3 protein was shown in human cells (Marra et al., 1998, 
Proc. Natl. Acad, Sci. USA 95, 8568-8573) to complex all available MSHl protein. This 
15 leaves MSH6 protein without its partner, leading to the degradation of MSH6 protein and 
eventually to a mismatch repair phenotype. 

This phenomenon is exploited to increase homeologous meiotic recombination in 
Arabidopsis as an alternative to antisense inhibition of AtMSH6. For this purpose the full- 
length cDNA encoding AtMSH3 is isolated from plasmid pPF66 by digestion with Smal 
20 and is cloned into the Smal site of the DMCl expression cassettes described in A(i). 

E. Selection of Recombination markers on homeologous chromosom es of Arabidopsis 
thaliana subspecies Landsber^ erecta (Ler). Columbia ( Cq\) and C24. respectively 

E(i). Visual recombination markers in /4rflfe/rfop^/5 f/z. subspecies C24: 

Agrobacterium mediated transformation with a T-DNA containing a 35S-GUS gene, 
25 inactivated by insertion of a 35S-Ac transposable element (Finnegan et al., 1993, Plant 
Mol- Biol. 22, 625-633), had yielded a C24 line in which the T-DNA construct was 
integrated into chromosome 2. Genetic and molecular analysis of this line shows that the 
Ac transposon had excised from its T-DNA locus thereby restoring GUS activity, but had 
re-inserted into the chromosome at a distance of 16.4 cM, where it stayed fixed (due to 
30 disablement of Ac) within the chlorina gene, Insertional inactivation of the chlorina gene 
caused a bleached phenotype in those plants that were homozygous for this mutation. 
Because of the two linked phenotypic markers, chlorina3A:Ac and GUS, this C24 line was 
used in crosses to wildtype Ler for analysis of meiotic homeologous recombination on 
chromosome 2 in conjunction with molecular recombination markers. 

35 E(ii). Visual recombination markers m Arabidopsis th. Ler: 

The Ler line NWl (obtained from NASC, Nottingham, UK) contains one recessive 
visual marker per chromosome, i.e. an-1 on Chr.l, py-l on Chr.2, gll-l on Chr.3, cer2-l 
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on Chr.4. and msl-I on Chr.5. This line is used in crosses to wildtype C24 which 
expresses an MMR altering gene for analysis of meioiic homeologous recombination on 
chromosomes 1-5 in conjunction with molecular recombination markers listed in Table 1. 

Other Ler lines from NASC have several visual markers in close proximity to each 
5 other on the same chromosome. When these lines are used for hybrid production, analysis 
of homeologous meiotic recombination may make use entirely of visual recombination 
markers. Particularly suitable for crossing to C24 wildtype that is expressing a MMR 
altering gene are the following Ler lines: 

NW22; relative markers are disl - (4 cM) - - (1 1 cM) - thl on chromosome I. 
10 NWIO: relevant markers are tz-201 - (5 cM) - cer3 on chromosome 5. 

NW134, relevant markers are ttg - (4 cM) - ga3 on chromosome 5. 

NW24 (abi3-l) and NW64 iglI-\). When present in the same plant on chromosome 
3, abi3-l and gll-l are 13 cM apart. Since this marker combination is not available from 
NASC. we have combined these markers by crossing line NW24 to line NW64. The Fl 

15 offspring were allowed to self-fertilise and to produce F2 seeds. F2 Plants which carry 
both markers as homozygous traits on the same chromosome can be identified firstly, by 
germinating F2 seeds on germination medium containing selective concentrations of 
abscisic acid, and subsequently, by identifying among the abscisic acid resistant plants 
those individuals which show the glabra phenotype. 

20 E(iii) Molecular recombination markers in Col. Ler and C24: 

The genome of Arabidopsis thaliana is interspersed with unique base sequences 
arranged as simple tandem repeats. Allelic repeats can vary in length between different 
Arabidopsis subspecies and when amplified by PGR yield diagnostic DNA products of 
different length named Simple Sequence Length Polymorphisms (SSLPs). Many SSLPs 

25 have been genetically mapped and have been assigned to unique chromosome locations on 
the recombinant inbred map (Bell and Ecker. 1994, Genomics 19, 137-144; Lister and 
Deans lines. Weeds World 4i, May 1997). 

In Table I are listed 28 mapped and established SSLPs between Ler and Col. A 
number of PGR primer pairs are described herein (SEQ ID NO:42 to SEQ ID NO:97) 
30 which also yielded SSLPs between C24 and L^r (19 SSLPs) or between C24 and Col (25 
SSLPs), respectively. Polymorphic SSLPs can be used as molecular markers in the 
analysis of homeologous recombination between genomes from these subspecies. 

The PGR reactions (25 ^L) were carried out over 35 cycles (15 seconds at 94»C, 30 
seconds at 55''G and 30 seconds at 72°G), with 0.25 U Taq DNA polymerase and 0.6 
35 genomic DNA in reaction buffer containing 2 mM MgC^. PGR products were separated 
by agarose gel electrophoresis (4% ultra high resolution agarose) and visualised by 
ethidiumbromide staining. The results from the PGR experiments are summarised in 
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Table I. which also shows the sequence of PCR primers, primer annealing temperature 

(Tm). PCR product length and chromosome location of SSL? (with respect to the RI map 

of May 1997, Weeds World 4i). 

F. Production of hvbrid plants 
5 C24 plants heterozygous for chlorina3A:AclGUS are crossed as male to emasculated 

wildtype Ler to produce Ler/C24{chlorina3A, GUS) hybrid seeds. 

Due to the heterozygosity of the C24 parent, only 50 % of hybrid plants have 

inherited the chlonna3A:Ac/GUS locus. The remaining 50% of hybrid plants are wildtype 

with respect to chlorina3A:Ac/GUS. Since the mutant locus is linked to a kanamycin 
10 resistance gene (contained on the same T-DNA as GUS) mutant plants can be pre-selected 

by germinating hybrid seeds on germination medium containing 50 mg/L kanamycin, 

Ler plants homozygous for the five chromosome markers are male sterile (msl-l) 

and are crossed without emasculation to wildtype C24 to produce Leiian-1. py-l. gll-L 

cer2-l. msl-l)IC24 hybrid seeds. 
1 5 Other Ler plant.s. which are male fertile, are crossed after emasculation of the female 

parent to produce LerlC24 hybrid seeds. 

n Introduction of MSm and M5//6/3 antisense genes into ArabidoDsis and analysis of 
meiotic homeolog nus recombination 

(i) Transformation of hvhrid plants and analysis of homeolop ous meiotic recombination 
20 The plant transformation vectors comprising the antisense genes described in (B) and 

(C) above are introduced into Agrobacterium lumefaciens strain AGL\ (Lazo et al.. 1991, 

Bio/Technology 9, 963-967) by electroporation. Recombinant Agrobacterium clones are 

selected on LB medium containing 50 mg/L rifampicin and 100 mg/L carbenicillin. 

Selected clones are used to infect roots of Arabidopsis hybrid plants (described in (F) 
25 above) using the root transformation protocol of Valvekens et al. (1988, PNAS 85 . 5536- 

5540) except that shoot and root inducing media contain hygromycin (10 mg/L) instead of 

kanamycin. 

Plants regenerated firom roots of hybrid plants are genetic clones of root donating 
plants and therefore are again genetic hybrids of two Arabidopsis subspecies described in 
30 (F). However, in contrast to the root donating plants, the regenerated hybrid plants also 
contain the introduced transgene and the co-introduced hygromycin resistance gene with 
the latter allowing these plants to grow on hygromycin containing culture medium. 

Hygromycin resistant plants are then allowed to enter the reproductive phase and to 
produce gametes by meiotic divisions of microspore and megaspore mothercells. At 
35 meiosis, the DMCl promoter is activated and can direct the expression of antisense genes 
described in (B) and (C) above, leading to decreased MSH6 and/or MSH3 gene 
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expression. This in turn depletes the gamete mothercells of MSH6 and/or MSH3 protein. 

thus causing alteration of MMR during meiotic divisions and increasing the recombination 

frequency between homeologous chromosomes. 

Transgenic plants are then allowed to self-fertilise and to produce seeds. These 
5 seeds (F2 seeds with respect to hybrid production), and the plants derived therefrom, 

carry the homeologous recombination events which can be identified by using the visual 

and molecular recombination markers described in (E) above. 

In case of homeologous recombination between chromosomes of Ler and 

C24ichlorina3A:Ac. GUS), the analysis concentrates on chromosome 2 by selecting plants 
10 showing the visual phenotypic marker chlorina. This marker thus serves as a reference 

point as it indicates that respective chromosomes 2 originate from C24. Other markers, 

such as GUS or molecular markers, on chromosome 2 may then be used to identify 

chromosomal regions which are derived from the Ler chromosome as a result of 

homeologous recombination. F2 plants of control transformants not expressing the 
15 antisense gene(s) can be analysed in parallel and the results can be used for comparison to 

homeologous recombination results obtained in antisense plants. 

(ii) Transformation of C24 witdtvoe. hvbrid plant produc tion and analvsis of homeologous 
meiotic recombination 

Introduction of MMR altering genes into wildtype C24 is done using the root 
20 transformation protocol as described in G(i) for transformation of hybrid plants. 
Transformed plants are selected by resistance to either 10 mg/L hygromycin (in case of 
transformation with T-DNA's derived from pNOS-Hyg-SCV) or to 50 mg/L kanamycin 
(in case of transformation with T-DNA's derived from pBIN19). 

Transgenic plants are then allowed to self-fertilise and to produce seeds (Tl seeds). 
25 Segregation of the antibiotic resistance gene in the Tl population then indicates the 
number of transgene loci. Lines with a single transgene locus (indicated by a 3:1 ratio of 
resistant:sensitive plants) are selected and are bred to homozygosity. This is done by 
collecting selfed seeds (T2) from Tl plants and subsequent testing of at least four 
independent T2 seed populations for segregation of the antibiotic resistance gene. T2 
30 populations which do not segregate the antibiotic resistance gene are assumed to be 
homozygous for both the resistance gene and the linked MMR altering gene. 

€24 plants homozygous for the MMR altering gene are then crossed to Ler lines 
homozygous for recessive visual markers (see E(ii)) to produce C24/Ler hybrid plants as 
described in (F). These Fl hybrids are then allowed to enter the reproductive phase and to 
35 produce gametes by meiotic division of microspore and megaspore mothercells. At 
meiosis. the DMC 1 promoter is activated and can direct the expression of antisense or 
sense genes described in (B), (C) and (D) above, leading to decreased MSH6 and/or MSH3 
gene expression. This in turn depletes the gamete mothercells of MSH6 and/or MSH3 
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protein, thus causing alteration of MMR during meiotic divisions and increasing the 
recombination frequency between the homeologous chromosomes of C24 and Ler . 
Recombination events are then scored in the F2 generation. 

For recombination analysis, the hybrid plants are allowed to self-fertilise and to 
5 produce F2 seeds. F2 plants are then preselected for a first visual marker. Since this 
marker is recessive, its visual presence indicates homozygosity for Ler DNA at the 
relevant locus. Those F2 plants which show this first visual marker are then analysed for 
the presence or absence of a second visual marker which in the Ler parent is closely linked 
to the first marker. Absence of the second visual marker indicates recombination between 
10 the relevant C24 and Ler chromosomes between the first and second marker. The 
frequency of recombination in transgenic hybrids is compared to the recombination 
frequency in control hybrids not expressing the MMR altering gene. 

Examples of recombination analysis are the following. 

The Ler line NW22(rf/57, ga4 . thl) is used for crosses to transformedC2-?. 

15 F2 plants are preselected first for thiamine requirement (thl) and then are further 
analysed for re-appearance of wildtype height (loss of ga4) and/or re-appearance of 
normal trichomes (loss of disl) as a result of recombination. 

The Ler line NW10(/z-20/, cer3 ) is used for crosses to transformedC24. 

F2 plants are then preselected first for thiazole requirement (tz) and then are further 
20 analysed for re-appearance of normal, i.e. non-shiny stems (loss of cer3) as a result of 
recombination. 

The Ler line NW134 (ng , gaS ) is used for crosses to transformedC24. F2 plants 
are first preselected for dwarfish appearance (gaS) and are then analysed for re-appearance 
of trichomes (loss of ng) as a result of recombination. 

25 Ler plants homozygous for abiSA and gll-l are used for crosses to transformed C2^, 
F2 plants are first preselected for their ability to germinate in the presence of abscisic acid 
and are then analysed for re-appearance of trichomes on the leaves (loss of gll-l) as a 
result of recombination. 

In the case of homeologous recombination between transformed C24 and the Ler line 
30 NWl (an-l, py-U gH-l- cer2-l, msl-l), recombination analysis is similar the one 
described above, except that the second marker is not a visual marker but has to be a 
molecular marker. This is because the Ler parent carries only one visual marker per 
chromosome. 
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TABLE I: SSLP Markers in Arabidopsis th 


Primer Sequence 




GCCACTGCGTGAATGATATG 


CGAACAGCCAACATTAATTCCC 




AACCAAGGCACAGAAGCG 


ACCCAAGTGATCGCCACC 




TACCGAACCAAAACACAAAGG 


TCTGTATCTCGGTGAATTCTCC 




GGTCTGTTGATGTCGTAAGTCG 


ATCTTGAAACCTTTAGGGAGGG 




CTGATCTCACGGACAATAGTGC ! 


GGCTCCATAAAAAGTGCACC 




CTCCAGTTGGAAGCTAAAGGG 


TGTTTTTTAGGACAAATGGCG 




CCTTCACATCCAAAACCCAC 
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CLAIMS 

I. An isolated and purified DNA molecule comprising a polynucleotide sequence 
encoding a polypeptide ftinctionally involved in the DNA mismatch repair system of a 
plant. 

5 2. A DNA molecule according to claim I wherein said polypeptide is 
homologous to a mismatch repair polypeptide of a yeast or of a human. 

3. A DNA molecule according to claim 1 wherein said polypeptide is 
homologous to AtMSH3 (SEQ ID NO: 19) or to AtMSH6 (SEQ ID NO: 31). 

4. An isolated and purified polypeptide functionally involved in the DNA 
10 mismatch repair system of a plant. 

5. A polypeptide according to claim 4 which is homologous to a mismatch repair 
polypeptide of a yeast or of a human. 

6. An isolated and purified polypeptide selected from the group consisting of a 
polypeptide encoded by the gene AtMSHS (SEQ ID NO: 18), a polypeptide encoded by the 

15 gene AtMSH6 (SEQ ID NO:30), polypeptides homologous to a polypeptide encoded by the 
gene AtMSHS (SEQ ID NO: 18) and polypeptides homologous to a polypeptide encoded 
by the gene AtMSH6 (SEQ ID NO:30). 

7. An isolated and purified DNA molecule comprising a polynucleotide sequence 
selected from the group consisting of (i) a sequence encoding a polynucleotide which is 

20 capable of interfering with the expression of a plant polynucleotide sequence encoding a 
polypeptide which is homologous to a mismatch repair polypeptide of a yeast or of a 
human and thereby disabling said plant polynucleotide sequence; and (ii) a sequence 
encoding a polypeptide capable of disrupting the DNA mismatch repair system of a plant. 

8. A DNA molecule according to claim 7 comprising a polynucleotide sequence 
25 encoding a polynucleotide capable of interfering with the expression of a plant 

polynucleotide sequence encoding a polypeptide which is homologous to a mismatch 
repair polypeptide of a yeast or of a human and thereby disabling said plant polynucleotide 
sequence. 

9. A DNA molecule according to claim 8 wherein said polynucleotide is capable 
30 of interfering with the expression of a plant polynucleotide sequence is a sense 

polynucleotide, an antisense polynucleotide or a ribozyme. 

10. A DNA molecule according to claim 7 comprising a polynucleotide sequence 
encoding a polypeptide capable of disrupting die DNA mismatch repair system of a plant. 
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11. A DNA molecule according to claim 10 wherein said polypeptide is 
homologous to AtMSH3 (SEQ ID NO: 19) or to AtMSH6 (SEQ ID NO: 31). 

12. A DNA molecule according to claim 10 further comprising a regulation 
element capable of causing overexpression of said polypeptide in a cell of said plant. 

5 13. A chimeric gene comprising: 

a DNA sequence selected from the group consisting of (i) a sequence encoding a 
polynucleotide capable of interfering with the expression of a plant polynucleotide 
sequence encoding a polypeptide which is homologous to a mismatch repair polypeptide of 
a yeast or of a human and thereby disabling said plant polynucleotide sequence, and (ii) a 

10 sequence encoding a polypeptide capable of disrupting the DNA mismatch repair system 
of a plant; and 

at least one regulation element capable of functioning in a plant cell. 

14. A chimeric gene according to claim 13 wherein said regulation element is 
selected from constitutive, inducible, tissue type specific and cell type specific promoters. 

15 15. A chimeric gene according to claim 13 comprising a DNA sequence encoding 
a polypeptide capable of disrupting the DNA mismatch repair system of a plant, wherein 
said regulation element is capable of causing overexpression of said polypeptide in a cell 
of said plant. 

16. A chimeric gene according to claim 13 wherein said regulation element is 
20 selected from the group consisting of 35S, NOS, PRla, AoPRl and DMCl. 

17. A plasmid or vector comprising a chimeric gene according to any one of 
claims 13-16. 

18. A plant cell stably transformed, transfected or electroporated wid:i a plasmid or 
vector according to claim 17. 

25 19. A plant comprising a cell according to claim 18. 

20. A plant according to claim 19 selected from plants of the families 
Brassicaceae. Poaceae, Solanaceae, Asteraceae, Malvaceae, Fabaceae, Limceae, 
Canabinaceae, Dauaceae and Cucurbitaceae. 

21. A process for at least partially inactivating a DNA mismatch repair system of a 
30 plant cell, comprising transforming or transfecting said plant cell with a DNA molecule 

according to any one of claims 1-3 or 7-12 and causing said DNA sequence to express 
said polynucleotide or said polypeptide. 

22. A process for at least partially inactivating a DNA mismatch repair system of a 
plant cell, comprising transforming or transfecting said plant cell with a chimeric gene 
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according to any one ot claims 13-16 and causing said DNA sequence to express said 
polynucleotide or said polypeptide. 

23. A process for at least partially inactivating a DNA mismatch repair system of a 
plant cell, comprising transforming or transfecting said plant cell with a plasmid or vector 

5 according to claim 17 and causing said DNA sequence to express said polynucleotide or 
said polypeptide. 

24. A process for increasing genetic variation in a plant comprising obtaining a 
hybrid plant from a first plant and a second plant, or cells thereof, said first and second 
plants being genetically different; altering the mismatch repair system in said hybrid plant; 

10 permitting said hybrid plant to self-fertilise and produce offspring plants; and screening 
said offspring plants for plants in which homeologous recombination has occurred. 

25. A process according to claim 24 wherein a first gene is incapacitated m said 
first plant, a second gene is incapacitated in said second plant, and said first and second 
genes are incapacitated in said hybrid plant thereby altering the mismatch repair system of 

15 said hybrid plant. 

25. A process according to claim 25 wherein said incapacitation of the mismatch 
repair system of said hybrid plant is reversible. 

26. A process according to claim 24 wherein a new genetic linkage of a desired 
characteristic trait or of a gene which contributes to a desired characteristic trait is 

20 observable in at least one of said offspring plants. 

27. A process for obtaining a plant having a desired characteristic, comprising 
altering the mismatch repair system in a plant, cell or plurality of cells of a plant which 
does not have said desired characteristic, permitting mutations to persist in said cells to 
produce mutated plant cells, deriving plants fi-ora said mutated plant cells, and screening 

25 said plants for a plant having said desired characteristic. 

28. A process according to claim 27 wherein said step of altering the mismatch 
repair system comprises introducing into said hybrid plant, plant, cell or cells a chimeric 
gene according to claim 13 and permitting the chimeric gene to express a polynucleotide 
which is capable of interfering with the expression of a plant polynucleotide sequence in a 

30 mismatch repair gene of the hybrid plant, plant, cell or cells, or a polypeptide capable of 
disrupting the DNA mismatch repair system of the hybrid plant, cell or cells. 

29. A process according to claim 28 comprising inactivating an MSH3 gene and/or 
an MSH6 gene of said plant. 

30. A process according to claim 28 comprising inactivating an MSH3 gene and an 
35 MSH6 gene of said plant. 



wo 99/19492 



33 



PCT/EP98/06977 



31. A process according to claim 27 comprising at least partially inactivating the 
mismatch repair system of said plant in a predetermined cell type or in a predetermined 
tissue of said plant. 

32. A process according to claim 31 further comprising restoring mismatch repair 
5 in said cell type or said tissue. 

33. An oligonucleotide capable of hybridising at 45''C under standard PGR 
conditions to a DNA molecule according to claim 1 with the proviso that said 
oligonucleotide is other than SEQ ID NO: 1 or SEQ ID N0:2. 

34. An oligonucleotide enable of hybridising at AS'C under standard PGR 
10 conditions to the DNA of SEQ ID NO: 18 with the proviso that said oligonucleotide is 

other than SEQ ID N0:1 or SEQ ID N0:2. 

35. An oligonucleotide capable of hybridising at 45''C under standard PGR 
conditions to the DNA of SEQ ID NO: 30 with the proviso that said oligonucleotide is 
other than SEQ ID NO: 1 or SEQ ID N0:2. 
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Figure 2 



Comments/References: 52= 3' side of S5 (AtMSH3) 1244bp in pUC18/Sma1 
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Figure 3 



Comments/References: 13 = 3' side of S5 (AtMSH3) 2104bp in pUC18/Sma1 



«i vv ri rrf ourrr mi ii 9* aa% 



wo 99/19492 



4 / 28 



PCT/EP98/06977 



o 

00 



ID 



o ir» 
CM rn 



vo in 
eg in 



CM in 



00 m 



in 
o m 

in rH 



rr in 
vo in 
in r^ 



^ tn 

\0 rH 



in 



^ in 



in 
o m 

00 CN 



^ in ^ in 
in eg r- 

00 CN CTi CNJ 



o 



E- 
£-< 
O 
Eh 
U 

&H 

< 

O 
U 
6h 

U 

o 
< 

u 
< 

CJ 

u 
< 
o 
u 
u 
< 

u 
u 

CJ? 

u 

1 

o 
u 
o 

^ 

u 
o 



u 
u 



U 

o 



u 
o 
u 

u 
o 

o 

E- 
E-» 

U 
E- 
E- 

£h 

O I 

u 

E- 

E- 
< 

< 



o 



u o 
o 

o 

E- 

< S 



u 

< 

u 

O 
U 

< 

u 

Eh CO 

o 

e- CO 



M O CJ 

u 
u 

o < 
< 



CD 

u 

CJ Clt 



o 

C3 W 

CJ a: 

E^ 
O 

< H 

c:) 
u 

CJ 

U 
U 

H CO 



u 



o 

Eh 

CJ hJ 

u 
< 

u a: 

CJ 

< 

CD Q 

O 
U 

H CO 

u 

Eh 

O 

E- 

E-* 

O 



CD 
(J 



CD 

3 



o 

CJ 

Eh CO 



U 

(J Du 

Eh 

o 

E- CO 

CJ 

Eh 

E-. Ci. 

U 
U 

Eh CO 

< 
Eh 

CD > 

O 

U 

a ^ 

CD < 
CJ 

u 

Eh CO 

< 
Eh 

< M 

CD 



O 
CJ 

C5 > 



3 



U 

CJ OLi 
CD 



U 
U 
U 

6 

E- 

CD 
U 
CD 

U 
U 
CD 

CJ 
CJ 
CD 



CO 



Eh 
Eh 

CD > 
H 

Eh >i 
CD C«3 

CD Cd 

CD 
U 

CJ Oj 

Ci) 

u 

Eh CO 

U 
U 

O ou 

< 
< 

CD Cd 

CD 
E-- 
CJ 

Eh 

H 

< 

CD 

< oc: 

o o 

u 

Eh 

CJ ^ 

< 

CD 



CJ a 

CJ X 

Eh ,J 



CD 

< 
Eh 

CD 
< 

CD C«3 
CD 

Eh 

O > 
CD 

CD > 

o o 

CD 
< 

CJ o 



O td 
CD 

< 
U 

CJ 04 

u 

< Eh 

E- > 



CD 
CD 

< a: 



CO 



CO 



CO 



1 



u 
u 

CJ Qu 



CD 
CJ 

< Eh 

CD U 

u 
u 

O cu 



< 
CD 

CD CD 
CJ 

Eh 

Eh Cu 
U 

Eh 

< 

CD 

CJ 

Eh >h 



CD 

< a: 

o 

CH >^ 



CD 

CD CD 

E-«. 
E- 

CD > 

CD Ci3 
CD 

CD > 
O 

CD 

E- 
E-- 

CD > 
CD 

CD > 

Eh 

< 

CD Q 

< 
U 

CJ cu 

CD 

CJ 
CD 

< CO 



u 

Eh 

Eh tu 



CJ 

CD 

CD 
H 



C^ 
CD 

U 
< 
E- 



E- 

< M 

E- 
CD 

CD CD 

CD 
E- 

CD 
c- 

CD > 

U 
CD 

CJ OC 

< 
u 

CD < 

< 
CJ 

CD < 

U 
H 

CD CU 

CD 
CJ 

CD < 
U 

CD Q 



< 
O 

CD < 



CD 

tH 

CD > 

CD 
Eh 

U ^ 

< 
CD 

< a: 



CD 

< CC 
CD 

e 

CD > 
CJ X 

u 



3 



CD Ci3 



CD 

Eh ^ 
< 

CD 
£- 

< 

u 

< E- 

u 

CJ 

CD 
E-* 

CD > 

Eh 

CD 

< CO 
CD 

a 

CD < 

CD 
CJ 

< H 

CD 
H 



u 

CD < 

H 
CD 

CD CD 

U X 
CJ 

u 

H CO 
CD 



Eh 
< 

u 
u 

CD 

< 
U 
CD 

E- 
U 



CD CO 

E- 

U 

< Eh 



< 

U 
CD 



CD 

CD > 

e- 

u > 

CD 

CD CD 

Eh 
£h 

< M 

CD 



< 
< 

CD 

CD CD 



CD W 

Eh 

U ^ 

CD 
U 

< Eh 

o 

U ^ 

CD < 



CJ 
E- 

CD 
H 

E-^ ^ 

CD 

O ^ 
CD < 

CD 
CJ 

e-^ CO 

CD 

< 

CD 

CD CD 

o 

CD 

CJ DC 

u 

Eh 
Eh 



Cl4 



E- 

a 

CJ cu 
a 

CD 

CD CD 

u 
u 

< EH 

CD 
CD 

O OC 



H 
CD 

< CO 

CJ a 

< 

u 

Eh CO 

CD 

CD CD 
E- 



CD 

a CD 

CD tU 



CD CU 
CD 

CD CD 

E- 

CD 

CD CD 

Eh 

CD 
CD 

CD CD 

E-- 

CD 

CD CD 



CD 
< 

< 
Eh 
< 

Eh 
CD 



CO 



U 

3 



CD 
< 

CD CJ 
CJ 

CD < 

CD 
CJ 

CD < 



CD CU 

Eh 

< M 

Eh 

CD 

CD CD 
CD 

u 

CD 

CD CD 

< 

c; 

< Eh 

CD 

CD CU 

CD 
U 

E- CO 
CD 

Cm 

b > 
< 

CD 

< oc 

CD 

CD CU 
E- 

CD Q 
O 

CD > 

Eh 

CD > 

E-* 

C 

Eh O 
Eh 

CD > 
CD 

Eh 

u 

E- 

Eh Ou 



Eh 

H 

CD > 

Eh 

Eh 

CD > 



CD CU 

H 

CD 

CD CD 

<: 
u 

< E- 



CJ 

t- CO 
E- 

CD CU 
Eh 

CD > 

U 
CD 

CD CD 

Eh 
£h 

CD > 



CD > 

E- 

CD 

CD CD 
U 

Eh 

CD > 

< 

CD 

< OC 

Eh 

H 

CD > 
H 

CD Q 
H 

Eh 

H Cu 

H 

CD 

< CO 

CD 
H 



0) 



•rl 



in 

o m 



in 

v£> in 



in vc 
eg 

CO 



in K£> 

00 (Tl 



in 



in v£) 
o pn 



in v£> 
vo in 



in <x> 



in vo 



in VD 



in vo 
o ro 



in y£> 
VD in 



00 eg 00 eg 



«i ip*ft^"«n r^r» m ifp**?" /«i ti *- ***** 



wo 99/19492 



5 / 28 



PCT/EP98/06977 



^ in ^ lo 

CO <T\ O rH 
CTfc CM rH 



o in 



vo in 
rH in 
th m 



CM in 
CM 



00 in 

CM <T\ 



o in 



in 
-^r tn 



CM in 
tn 

r-l ^ 



GO in 
in c^ 



^ in 
i-H in 



o in 
p- m 
f-^ in 



vo in 
in 
iH in 



CM lO 

CO r- 
r-i in 



O 

U 
O 

O 
E-i 

H 

E-t 

CJ > 

O ^ 
O < 

< 

H 

<C CO 

a: cc 
u 

a 

Eh 

< 
U 



O > 

O 

< 

CJ 



< 

u 

H CO 
H 

E- 

U 

CJ a 
o 

u o 

u 
o 

U .-J 
Em 

E- ^ 

U 
H 

o 

O fit 

u 

CO 



u 

< CO 

u 

U 
E- 

o 

E-" 

o 

E-* CO 

U 
U 



CJ 
O 

CJ > 

<C 
O 

E- 

U > 
U 

< 

CJ 

e- CO 

o 
u 

< E-t 

Eh 

U 

CJ a* 
< 

t- 

u 

CJ < 
E- 

u s 

<c 
o 

o < 



< 

E^ 
E^ 



O 

CJ CJ 

< 

CJ 



CO 



CJ 
CJ 
< 

CJ 

< 

< 



CD UJ 

E- 

CJ 

E-t u 
< 

Eh J 

U 

E- :o 
E- 

CP > 

(J 
< 

CJ u 
E^ 

< 

CJ > 

u 

C3 < 



E- 
CJ 

CJ C3 



E^ 
H 

O > 

< 
O 

< E^ 

CJ 
E-H 

U 
CJ 

E- U 

Eh 
CJ 

E-« CO 

CJ 
E-* 

CJ c:) 



It: 



o cu 



6- 
< 

U X 

u 

CJ 

tH U 

Eh 
H 

O 

u 

< Eh 
< 

E-* 

o 

CJ 

CJ < 

u 

CJ 

u 

CJ < 



Eh 

H 

CP > 

E- 

CJ 

< Eh 

CJ 
E- 
< 

u 

CJ a. 

CP 

< S 

U 

Sz 

CP 

Eh 

Eh 
Eh 

I. 



< 

u 

H CO 

CP 
H 

Eh J 

E-t 
U 

E-4 CO 

U 
CJ 

u o; 

E-t 

Eh 

< 

u 

E- CO 

CJ 

u 

CP < 

. CP 

(J 

CJ CP 



u o u o 



CJ 

< 

Eh >i 

E-- 

Eh 

U J 
CJ 

Eh 

CJ 
CP 

a: a: 



CP u 

Eh 

Eh Cju 

< 

CP 

CP CP 
Eh 

Eh 

Eh Lu 
CP 



CJ 



Sz 



o 

Eh 

CP > 

CJ > 
CP 

O Ci3 

CP 
E- 

E- J 

CP 

< _ 



CP 

U J 

E- 
CJ 

< E- 



u 

CJ < 

< 
u 

E- CO 

o 

Eh 

a 

E^ 

CJ 

< Eh 

CP 

Eh 

CP 

< 

CP Ct] 



Eh 
CP 

< CO 



Eh 

CJ > 

< 

u 

H 
E- 

U ►J 
<C 

u 

CJ 

U X 



CJ 



Eh 

CJ X 

u 

Eh 

Eh Lu 

< 
Eh 

^ -3 

U 
U 

^ CO 

CJ 
CJ 

CJ CJ 

H 
U 

£-» CO 

% 

CJ u 
CJ 

a 

Eh CO 

< 
CJ 

CJ CJ 

e-t 
< 

CJ Q 

< 

CJ 

E-" CO 

Eh 

^z 



< 

< M 

CJ 

Eh 

E-t J 



CO 



CO 



3 



CP 

< VC 
E- 

CP Q 

CJ 
CJ 

E- CJ 
< 

CJ ►J 
E- 

CJ oi 



< 

U X 



CJ 
CP 

Id > 

CJ 
CJ 

Eh 3 

CJ 

U X 

< 

CP 

< Q£ 



CJ 

CJ 
CJ 

CJ 

u 

Eh CO 

Eh 
U 

CJ CJ 

Eh 

< 



Eh 

CP 

rt: CO 

Eh 

CJ X 
£h 

u 

Eh CO 
< 

CP 

CP CP 

CP 
Eh 



u 

CJ 
H 

Eh 

CJ 
CP 

Eh 

u 

Eh 



CJ 



Eh 

a 
< 

CP CJ 
E- 

u 

Eh CO 

e- 

CP > 
E- 

u 

CP < 

Eh 

CP Q 

Eh 

H 

U J 
CP 

CJ cc 
H 

u ^ 

CP < 

H 
CJ 

Eh CO 



Eh 

% 

CJ u 

Eh 

u 

CJ Oa 

< 

O 

Eh O 

CP > 

Eh 

E- 

< 

U 

CP < 

<: 

CP 

< X 

o 

CP CJ 

Eh 

CJ 



E- 

CJ 

CJ CP 



CP 

CP tJ 
E- 

Eh 

CP > 
CP 

Eh 

Eh J 

CJ 

CP CJ 

Eh 

CP 

< CO 

CJ 
CJ 

< CO 
CJ 

Eh 

CP 



< 

CJ 

CP CP 
CP 

u a 

H 

< M 
Eh 

CP Q 

E- 
U 

E- CO 

< 
CJ 
E- 



co 



E- 

CJ 

£- CO 

CP 
H 

Eh 

CJ 

CP < 

o 

< Eh 

a 

Eh 

U 

Eh 

CP > 

u 

Eh CO 

u 
a 

Eh CO 

u 

CJ a 
o 

Eh 

CP > 

u 

Eh 
Eh 

< 



CJ ^ 
CP < 



CJ CJ 

CJ 
H 

< 2 

Eh 
£h 

CJ > 
< 

CP <: 

Eh 
Eh 

< M 

o 

Eh 
Eh 

CP 

< 

CJ CJ 
< 

u 

< Eh 

CJ 

u 

CJ < 



Eh 

CJ 

CJ < 

Eh 
U 

< Eh 
CP 

CJ 
E-t 

O X 

Eh 

E-t 

Eh 

CJ 
Eh 

< 

CP 

< cc 
< 

E-t 



CJ 

CP 
CP 

CP u 

Eh 

u 

£h CO 

a 
< 

CP o 

€) 
3 

c 

o 

a 



o o 

CP 



Eh 

ft hH 

o 

CJ 

CP CJ 
E^ 

a 



&4 



CP 
CP 

a 

CP 

< 

CJ 



fS4 



Eh 
Eh 



CP 
CP 

o 

CP CP 
CP 

u 

CP < 

Eh 
Eh 

< 

EH ,J 
Eh 



tn ^ 
CM r» 

<X\ CM 



in \o 

GO 

CTt CM 



in vD 
o r> 



in \o 
o 



in vo 
vo in 
rH ro 



in vD 

<M 

CM n 



in vo 

00 OS 



in 

ro ^ 



in vo 
o m 



in 

vo vo 
TT in 



in lO 
CM r- 
in ^ 



tn vo 
00 a\ 

in 



in vo 
o ro 
in 



in vo 
vo in 
r- in 



' /m It 



wo 99/19492 



6 / 28 



PCT/EP98/06977 



GO in 
00 o^ 
rH in 



in 

i-H V£> 



O lO 

o 



o in 

CN 



00 m 



in 

CM 



o in 
CM 



ro in 



CM in 
rr p- 
CM r*- 



CO in 
CM 



TT in 
m 1-1 

<M CO 



o in 
VD n 

CM 00 



i£) tn 
in 
CM oo 



CM in 

CM 00 



u 

H CO 
H 

5m 

E- 
H 

O > 
U 

H CO 
H 

Sm 

(D 



o 

E- 

(J 

e- 
u 

H CO 
U OS 

o 

o > 

£-> 
U 

< 

u < 

U 

H CO 

u o 

< 2 

H 
O 

< CO 



u 

o < 

o 



U 
U 

E- CO 
U 

U .J 



< 

c:) o 

u 
u 

o < 

H 
E- 

U > 

a 

O > 
E^ 

O > 
U 

u 
u 

E- to 



u 

u < 

H 
E-i 

< 

c? to 

H 

u 



u 
u 

U 
CJ 

H to 



CJ 
< 

CJ 

< 

£- 
< 

O 
< 

u 
o 

E^ 

E- J 

U 
< 

{J a 
e- 

a oc 
o > 



E- 

£-4 Ui 

U 

CO 

e-* 

CJ 

O 
U 

E-i to 

CJ 
U 

e- CO 

E- 

CJ o 

CJ 
E- 

a J 

CJ 



CJ 

CJ 
CJ 

< cd 

u 

CJ 

(J > 
< 

H 

o > 

< 

CJ 

CJ < 



o o 

CJ 

u a: 

E- 

8^ 



CJ 
< 

U X 

< 

u 

< E- 
CJ 

c:) 

C5 

CJ CJ 

c:) 

CJ 

E- CO 
CJ 

5 > 
< 

CJ o 

L) .J 
E- 



CJ 



< 
O 

o a: 

E- 

< M 

E- 

CJ 

u 

Eh 

C5 

O 
CJ 

CJ oc 



CO 



CJ 

u 
< 

o 

C5 



< 

E- 

CD 



c:) *i 
< 

6- 

u 

CJ < 

< 

E- 
< 

CJ OJ 



e- 

o > 




CJ 


6 CJ 


E- 

%^ 


CJ 


CJ 
E- 


< 

CJ > 


U 

CJ 0- 


<' 


CJ 

CJ > 


CJ w 




< 

u 

CJ a 


u 
o 

H to 


u 

CJ 

CJ Cb 


CJ Q 


CJ 


E^ 

O > 


< 


U 

u 


< 

ID 


CD 
E^ 


£- 
H 


o 

CJ CiJ 


O 

< E-< 


< 


C5 



CJ 

< CO 

u 

H 

E-< 
CJ 

< to 

CJ 

U 
Eh 

U J 

CJ 
E- 

E- 

c:; 

E- 

CJ 
CJ 

Eh S 

CJ 
CJ 

H CO 

Eh 

U 

CJ < 

< 

CJ 

CJ c: 
o 

c? 

CJ > 
E- 

U 
CJ 

o < 

£h 
< 



< Eh 



tj 

Eh 

Eh J 

Eh 

CJ 

t- CJ 
O 

CJ Q 
C? 

< 

CJ 

CJ < 

Eh 

CJ 

CJ < 

CJ 

CJ < 

CJ CJt 
E- 

CJ > 
u 

CJ 

CJ < 



u 

CJ < 



Eh 

Eh 



CJ 

H CJ 
O 

CJ Q 

Eh 

CJ Q 
O 

CJ > 
£h 

H O- 

C3 

CJ uJ 

u 

CJ 

CJ Ou 

o 

CJ a: 
u 

e) > 
< 



CD Q 

< 
CJ 

U 
< 

Eh >. 

CJ 



o 

< OS 

E- 
CJ 

H CO 
ft 

O J 
O 

< Eh 

<: 

o 

E- 



CO 



Eh 
Eh 

CJ .J 

u 

CJ 

CO 



CJ o 

Eh 

£h 
CJ 

CJ 
<C 

CJ CJ 

o 

Eh 

< 

CJ > 

Eh 

CJ 

CJ 0- 
E- 

E- 

CJ cc 

CJ 

CJ CJ 

E- 

CJ 

E- CO 

CJ 

U O 
< 

Eh 

< M 

CJ 

CJ CJ 

Eh 
Eh 

CJ > 

u 

CJ CU 
CJ CJ 



u 

&H 

H 

u 

CJ 
Eh 

Eh >h 

CJ CJ 

CJ 
CD 

CJ CP 

CJ 03 

CJ ^ 
CJ < 

Eh 

U X 

CD 
H 

Eh 
E- 

< 
CJ 

U 

CJ Q 



< 

U 

CJ Ot 

u 

CJ > 

u 

Eh 

U 



Eh 

CJ Q 



CJ 
CJ 

H CO 

Eh 
< 

CJ _ 
CD < 

Eh 

CD > 



CJ O 

E- 
CJ 

CJ DC 
CJ 

Eh 

Eh 
< 

U 

o 

E- O 

CJ 
CJ 

< CO 
CJ 



< 
CD 

O CJ 

< 
CJ 

CJ CJ 

a 

< 2 



Eh 

u 

CJ Cu 

< 

CD 

CJ CD 

U 
U 

< £h 



E- 
CJ 

CJ CJ 

Eh 

< 

CJ Q 
CJ 

EH 

CJ > 

CJ X 
CJ 

CJ hJ 
CJ 

u 
u 

CJ < 

o 

Eh 

Cu 

< 
U 

Eh CO 

CD 

U ^ 
CJ < 

< 

CJ 

CJ Ou 

< 

Eh 

CJ > 

E- , 
H Cu 

CJ 
O 

Eh CO 

9h 
CJ 

CJ CJ 
Eh 

CJ > 

u a 

CJ < 
CJ 



CJ w 

Eh ^ 
H Ou 

CJ 

Eh 
CJ 

< CO 

< 

CJ 

< oc 

o 

CJ 

CJ CJ 
£h 

O X 
CJ 

a o 
o 

H 
CJ 
< 



§ 

•H 
0 

u 



CO 



< 
CJ Q 

< 

U 

H CO 

H 
CJ 

CD < 

E- 
CJ 

CJ CJ 

CJ 
£h 

< 2 

CJ 
CJ 

U OS 
H 

u 

Eh 

CD > 



M 
•ri 



in 

CM VO in VD 

00 r*- 00 (T\ 

f-t in 00 in 



in 



in VD 
o ro 

o VD 
CM 



in VD 
VD in 

O VD 
CM 



in 

CM VD 
CM VD 



in 

00 VD 

,-1 (Tk 

CM VD 



in VD 



CM 
CM 



in VD 
o ro 

CM 



in VD 
VD in 

CM 



in VO 
CM r- 



in VD 

CO CTk 
CM 



in VD 
in 00 

CM 



in ID 
o ro 

VD 00 
CM 



in VO 
VD in 

VD 00 
CM 



wo 99/19492 



7 / 28 



PCT/EP98/06977 



00 in 

ON 
CNi 00 



CD rH 

CM <Tk 



o in 
CM o\ 



\D tn 
ON in 
CM <T\ 



CM in 
o p- 



00 in 
o c\ 



^ in 

■C* rH 
rH O 



in 
O ro 

CM O 

ro rH 



^ in 
\D in 

CM o 



in 

CM 

m o 



CM 
rn o 

m rH 



00 

in 



ro in 



CM 
CM 

in v£> 

m rH 



< 
H 

O > 

U 
U 

H CO 

CJ 

H CO 

f- 
O 

e- CO 
t- 

u 
u 

< 
o 

< a: 
tj 

< 

a 

u X 
u 

H CO 



i 



CD U 

c:) 

< CO 

< 

U CO 



H 

u 

< H 

U ^ 

U 
U 

< t-H 

u 

(J ^ 

CD < 

tJ) > 
CD 

CD CD 

U 
< 

CD Q 

U 
< 

< 
CJ 

< H 

CJ 
O 

< CO 

H 
O 

U 
CJ 

CD CD 

< 
CD 

o 

CD CD 
CD 

CD Ci3 
CD Q 



CD C«3 

E- 

U 

CJ a. 
u 

U X 

CD 
U 

< H 

CJ 

CD > 



CD > 
CD 

CD 

CJ 



CD 



a: 



CD U 

CJ ^ 
CD < 

< 

Eh 

CJ ^ 

u 

Eh 

CJ X 
CD 

U Oi 



U4 



5 



u -a 



CJ 



CD < 



CD 

< a: 

<C 
u 

CD < 
H 

CJ a: 

CD > 

CD 
< 

CD CJ 

u 

CD < 



CD tJ 



E- C24 



CD CD 



< CO 



E- CJ 



U .J 



u 

CD < 

E-^ 
U 

CD < 
CD 

< 
CJ 



CO 



E- 
E- 

< M 

O 

o (St: 



< 

CD 

U I 

Eh 
CD 

a 

Eh 

CD 

Eh 

< 

u 

Eh 



CD CJ 
< 

CD < 

CJ 
CD 

CD CD 

< 
CD 

CD 
U 

CJ Cu 



CD tJ CD to 

J CD CJ 

#^ Eh 
2 < 

< 

CD 

CD CD 



< 

CD Q 
CD 

CD CJ 



CD CJ 

< 
O 

CJ Cu 

CD tJ 

< 

CD 

CD CD 
CD 

u 
a 

CJ a: 
< 



CO 



CD 

< a: 

CD 

CD tJ 



CD td 

Eh 

CJ 

£h to 
U 

U 

fH 



CD Q 
U 

CD < 



CJ 

CD Q 

Eh 

CD 

CD CD 
< 

CJ J 

CJ ^ 
CD < 

CD 
U 

H CO 

&H 

U 

Eh CO 



CJ 
CD 

CD CD 

U 

CD < 



CD 
CD 

E- 
CJ 
CD 



< 



CD 



< 



O 
U 
CD 
Eh 
Eh 
CD 
< 

£h 
CD 

Eh 

U 

H 
< 
U 

Eh 
< 

O 

Eh 
Eh 

u 

E- 

CD 
CD 



CD 
< 

CD CJ 
CJ 

CD < 



CD 
CD 

Eh 2 
Eh 

CJ 

CJ CL 



CD 

u 

CJ Ou 



Eh 

CJ 

H CO 

Eh 

CD Q 

CD 
E^ 

< 2: 

O 
< 
CD 
< 

CD 
< 

E- * 



CD 

CD CJ 



Eh 

CD 

< CO 

Eh 

H 

< M 

Eh 

CD 

H U 
CD 

Eh 

H 

U 

E-« 
U 
< 



0 

c 

H 
C 

o 
u 



CD rt: 

CJ 

u 

CJ Oj 

< 
o 

CJ £1^ 

Eh 
U 

Eh CO 

£h 

Eh 

Eh Cli 
U 

< M 

H 5H 

CD 

U O 
CD 

Eh 

E- 

CD 
Eh 

CD > 



ES4 



in 

CM VO 
CM 00 



in 

00 OS 



00 



in vo 

^ 1-H 
00 CT\ 



in (£> 
vo in 



in vo 

00 <Ts 

O ON 

m 



in vo 

rH 

rH o 

rH 



in vD 
o n 

CM O 
n rH 



in vo 
vo in 

CM o 

m rH 



in 
CM 

m o 

m rH 



00 

PO 

m 



in 
ro 



wo 99/19492 



8 / 28 



PCT/EP98/06977 



in 






tn o\ 


* »^ 
<-» <^ 
f4 ^ 


r* *0 

i-» OD 
Pt « 


« o 
o o 


!•« s 

O PI 


« 1A 


o ^ 

lA m 


o « 


in « 


o « 
f- r« 
CD a 


« 0* 

« to 




d u 
< m 


«i U 
^ M 


U 
CO 


^ u 

4 » 


< ca 


u 0 
4 n 


u u 

<< CO 


ii U 
< 01 

1 1 


u u 

• 1 


u u 
4 cn 


1^ U 

•« ta 

1 1 


t 1 

m m 
a 39 
n ca 

n X 


I 

a s 

0) M 
S X 


( . 

s a 
tn cn 
X X 


1 1 

a a 
n 01 
X X 


1 1 
m « 

a a 
CQ a 
X z 


. 1 
(-» «*i 
a a 
cn 0) 
X X 


1 1 
a a 

tQ 01 
X X 


1 1 

*»» »*» 
a a 
n n 

X X 


#^ m 

a a 

CO cn 
X X 


a a 

en en 
X X 


in 

a a 
01 cn 
X X 


«n « 
m a 

0) 01 

X X 



I a 

Oi K 

M O 
M H 

s * 

4 i 

a I 



in m 
o o 



< n 

I I 



m inoTm rrc ouccrr 



/ni ti f" Am 



wo 99/19492 



9 / 28 



PCT/EP98/06977 



U 

9 



vol oo 

CO 



iJ 
J 



00 

on 



00 
so 



r 



oc 

SC 
C/0 



4 

r 



"1 



C oo 
O m 



c 
S 
X 



a. 
o 

CO 
CO 

m 



C 

o 



a 

Xi 
(N 
OO 



wo 99/19492 



10 / 28 



PCT/EP98/06977 




Figure 7 



Comments/Rolerences: 43= 5' side of S8 (AtMSH6) 2182 bp in pUC18/Sma1 
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Figure 8 



Comments/References: 62= 3' side of S8 (AtMSH6) 1379bp in pUCl8/Snna1 
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TTTTTTGGTTGCTAACAATAAAGGTATACGGTTTTATGTCATCAATATAA 50 

CTATATATAAAAGAAATGAAAGATATATATTGTTTTTTCATTTATCAAAC 100 

AAAACAACAAGACTTTTTTTTTACTTTTTACATTGGTCAACAAAATACAA 150 

GATAAACGACATCGTTTAATCATTTCCCAATTTTACCCCTAAGTTTAACA 200 

CCTAGAACCTTCTCCATCTTCGCAAGCACAGCCTGATTAGGAACAGCTTT 250 

ACCATTCTCATATTCCTGAACTACCTGAGTCCTCTCATTGATCTGTTTCG" 300 

CCAAATCCGCTTGTGACATCTTCTTCTCCAATCTCGCTTTCTGTATCATC 350 

AACCTCACCTCTGCTTTCACACGATCCATCGCCGCAGGCTCTGTTTCTTC 400 

TTCCAGCTTCTTCGTGTTAATCACCGGAACCGCCGTAGATTTCCCCTTTT 450 

TGTTCGAACCGGCATCGAATTTCTTAACCGTTTGAACCGCGACACCGTTT 500 

CTCAGAGCTGCGTTAACCGCTTTCGGATCGCGTAGGTCTTGGCTCTTTTG 550 

TTTTGATTTGTGGAGAACTACTGGTTCCCAGTCTTGTGTTACTGCTCCTG 600 

GGTATCTGCTCGGCATCGTCGATGAATTGAGAGAAAGGAACAACGCGAAA 650 

ATTTTATTAATCTGAGTTTTGAAATTGAGAAACGATGAAGATGAAGAATG 700 

TTGTTGAGAGGATTGTGATATTTATATATACGAAGATTGGTTTCTGGAGA 750 

ATTCGATCATCTTTTTCTCCATTTTCGTCTCTGGAACGTTCTTAGAGATG 800 

ATTGACGACGTGTCA7TATCTGATTTGCAGTTAACCAATGCTTTTTGGGT 850 

TGGATTCGTGGTACACCATATTATCCGATTTGGCTCAATGGTTTTATATA 900 

AATTTGGTTTTCGGTTCGGTTATGAGTTATCATTAAAATTAAGCTAACCA 950 

AAAATTTTCGTAAAATTTATTTCGGTTrCAATTCGGATCCCTTACTTCCA 1000 

GAACCGAATTA7TCGAAACCGGGGT7AGCCGAACCGAATACCAATGCCTG 1050 

ATTGACTCGTTGGCTAGAAAGATCCAACGGTATACAATAATAGAACATAA 1100 

ATCGGACGGTCATCAAAGCCTCAAAGAGTGAACAGTCAACAAAAAAAGTT 1150 

GAGCCCTGAGGAGTATCGTTTCCGCCATTTCTACGACGCAAGGCGAAAAT 1200 

TTTTGGCGCCAATCTTTCCCCCCTTTCGAATTCTCTCAGwTCAAAACATC 1250 

GTTTCTCTCTCACTCTCTCTCACAATTCCAAAAAATGCAGCGCCAGAGAT 1300 

CGATTTTGTCTTTCTTCCAAAAACCCACGGCGGCGACTACGAAGGGTTTG 1350 

GTTTCCGGCGATGCTGCTAGCGGCGGGGGCGGCAGCGGAGACCACGATTT 1400 

AATGTGAAGGAAGGGGATGCTAAAGGCGACGCTTCTGTACGTTTTGCTGT 1450 

TTCGAAATCTGTCGATGAGGTTAGAGGAACGGATACTCCACCGGAGAAGG 1500 

TTCCGCGTCGTGTCCTGCCGTCTGGATTTAAGCCGGCTGAATCCGCCGGT 1550 

GATGCTTCGTCCCTGTTCTCCAATATTATGCATAAGTTTGTAAAAGTCGA 1600 

TGATCGAGATTGTTCTGGAGAGAGGTACTAATCTTCGATTCTCTTAATTT 1650 

TGTTATCTTTAGCTGGAAGAAGAAGATTCGTGTAATTTGTTGTATTCGTT 1700 

GGAGAGATTCTGATTACTGCATTGGATCGTTGTTTACAAATTTTCAGGAG 1750 

CCGAGAAGATGTTGTTCCGCTGAATGATTCATCTCTATGTATGAAGGCTA 1900 

ATGATGTTATTCCTCAATTTCGTTCCAATAATGGTAAAACTCAAGAAAGA 1850 

AACCATGCTTTTAGTTTCAGTGGGAGAGCTGAACTTAGATCAGTAGAAGA 1 900 

TATAGGAGTAGATGGCGATGTTCCTGGTCCAGAAACACCAGGGATGCGTC 1950 

CACGTGCTTCTCGCTTGAAGCGAGTTCTGGAGGATGAAATGACTTTTAAG 2000 

GAGGATAAGGTTCCTGTATTGGACTCTAACAAAAGGCTGAAAATGCTCCA 2050 

GGATCCGGTTTGTGGAGAGAAGAAAGAAGTAAACGAAGGAACCAAATTTG 2100 

AATGGCTTGAGTCTTCTCGAATCAGGGATGCCAATAGAAGACGTCCTGAT 2150 

GATCCCCTTTACGATAGAAAGACCTTACACATACCACCTGATGTTTTCAA 2200 

Figure 11 
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GAAAATGTCTGCATCACAAAAGCAATATTGGAGTGTTAAGAGTGAATATA 
TGGACATTGTGCTTTTCTTTAAAGTGGTTAGTAACTATTAATCTAGTGTT 
CAATCCATTTCCTCAATGTGATTTGTTCACTTACATCTGTTTACGTTATG 
CTCTTCTCAGGGGAAATTTTATGAGCTGTATGAGCTAGATGCGGAATTAG 
GTCACAAGGAGCTTGACTGGAAGATGACCATGAGTGGTGTGGGAAAATGC 
AGACAGGTAAATTAGTTGAAACAACTGGCCTGCTTGAATTATTGTGTCTA 
TAAATTTTGACACCACCTTTTGTTTCAGGTTGGTATCTCTGAAAGTGGGA 
TAGATGAGGCAGTGCAAAAGCTATTAGCTCGTGGGTAAGGGAACCATCAT 
ACTTTATGGAATTCGTTTACTGCTACTTCGGCTAGGATTTAAGAAATGGA 
AATCACTTCAAGCATCATTAGTTAGGATCCTGAGAACTCAGGATGTTTTC 
TTATTCGTTATATAATAAGTCTTTTCATCAAGGAGTAACAAACAAAACTT 
GCACAATATTTGTGTGCTCACTGGCAAGGCATATATACCCAGCTAACCTT 
TGCTAGTTCACTGTAGTAACAGTTACGGATAATATATGTTTACTTG7ATG 
TGGTACCCTCATTTTGTCTCTCATGGAGGCT7TGAAGCCTTGTGTTGAAA 
CTGGATAGTTACATATGCTTCCAACAGAAACTAGCATGCAGATTCATATG 
CTTTCCTATTCTACTAATTATGTAT7GACACACTCGTTGTTTCTTTTGAA 
AGATATAAAGTTGGACGAATCGAGCAGCTAGAAACATCTGACCAAGCAAA 
AGCCAGAGG7GCTAATACTGTAAGT7TTCTTGGATAGGTCAAGGAGAGTG 
T7GCAGAC7G77777GA7CA777CTT777C7G7ACA77AC777CA7GC7G 
7AA77AAC7CAA7GGC7A77C7GGTC7GA77A7CAGA7A.A77CCAAGGAA 
GC7AG77CAGG7A77AAC7CCA7CAA.CAGCAAGCGAGGGAAACA7CGGGC 
CTGA7GCCG7CCA7C77C77GC7A7.AAAAGAGG777G77A777AC77A77 
TA7C77A7CA7G77CAG77CA7CCAAG7CC7GAAAAA7TACAC7C77C77 
7ACCAA7C77CCA7CAAGC7G7GTAAAGGA777GGAA77AGAAAA7CA77 
A7TTGA7GC777G777TA7A7GCAAGAGG77CCC77GAAAAGA7C7G777 
AAGA77C7T7GCAC77GAAAAAT7CAA7C77777AAG7GAA7CCCCTAC7 
77C77ACAA7GA7CA7AG7C7GCAA77GCA7G7CAAG7AA7A7CA77CCT 
7G77AC7GCA7CCCCC7C777C77.\A7GACCA77G7C7A7G77G7G777G 
7C7CG7GTGC7GGAGAAAA7GA7AGC7GA7CCAAGC7G7ACA77A7CA7G 
ATTAAG7AGC7GCTCAGGAA77GCC777GG7TACA77GCC7AA7GG7T7G 
A7GTCAA7T7T7C77C7GAA7C777AT777AGA7CAAAA7GGAGC7ACAA 
AAGTG7TCAACTG7G7ATGGA777GC77TTGT7GAC7G7GC7GCC77GAG 
GTTTTGGG77GGG7CCATCAGCGA7GA7GCA7CATG7GC7GC7C77GGAG 
CGTTATTGA7GCAGG7AAGCAAG7G7A77CTGTA7C77A7G7G7ACCA7G 
TGACTTCCTG7GCATATATTTGGG77GCAGGAAC7AA7TC7GAA7CACCA 
TTTGGTA7GTTTTT7CCAGG7T7C7CCAAAGGAAG7G7TA7A7GACAG7A 
AAGGTAAAC7GC7TG7A7CGCCAG7TGTT7TGT7AAACAGAA77TAAGG7 
AAATGACACTGGTTAATTTAAAG7GCATACATG7TGAAA7A7TGCAGGGC 
TATCAAGAGAAGCACAAAAGGCTC7AAGGAAA7ATACGTTGACAGGTACC 
ATTTCAGTAGGCAAGC7AACTGACAA77TAACCGCTCACCGAATGATAGG 
TCTCT7AAACAT7GCTAATG7AGA7GA7G7T7A7G777CAATCTAATAGG 
GTCTACGGCGGTACAGT7GGC7CCAGTACCACAAG7AATGGGGGATACAG 
ATGCTGCTGGAGTTAGAAATATAATAGAATCTAACGGATACTTTAAAGGT 
TCTTCTGAA7CATGGAAC7G7GC7GT7GA7GG7CTAAATGAA7G7GA7GT 

Figure 11 <Contxnu®d) 
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TGCCCTTAGTGCTCTTGGAGAGCTAATTAATCATCTGTCTAGGCTAAAGG 
TGTGTTGGCTTGTTTAGTTTTTGCTTTTCACAAATTAAGCAAAGGAACTT 
TTCATAACTTACAGTTTCTATCTACTTGCAGCTAGAAGATGTACTTAAGC 
ATGGGGATATTTTTCCATACCAAGTTTACAGGGGTTGTCTCAGAATTGAT 
GGCCAGACGATGGTAAATCTTGAGATATTTAACAATAGCTGTGATGGTGG 
TCCTTCAGGCAAGTGCATATTTCTTTTTTGATAACTTCAACTAGAGGGCA 
GACATAGAAGGAAAAATTCTAATACTTCGTACGGATCTCCAGTAAGTAAT 
AGCCGATTTTTGTTTACCTATGTAGGGACCTTGTACAAATATCTTGATAA 
CTGTGTTAGTCCAACTGGTAAGCGACTCTTAAGGAATTGGATCTGCCATC 
CACTCAAAGATGTAGAAAGCATCAATAAACGGCTTGATGTAGTTGAAGAA 
TTCACGGCAAACTCAGAAAGTATGCAAATCACTGGCCAGTATCTCCACAA 
ACTTCCAGACTTAGAAAGACTGCTCGGACGCATCAAGTCTAGCGTTCGAT 
CATCAGCCTCTGTGTTGCCTGCTCTTCTGGGGAAAAAAGTGCTGAAACAA 
CGAGTAAGTATCAATCACAAGTTTTCTGAGTAATGCCTTCCATGAGTAGT 
ATAGGACTAAAACATTACGGGTCTAGCTAAAGACTGTTCTCCTTCTTTTG 
CAATGTCTGGTTATTCATTACATTTCTCTTAACTTATTGCATTGCAGGTT 
AAAGCATTTGGGCAAATTGTGAAAGGGTTCAGAAGTGGAATTGATCTGTT 
GTTGGCTCTACAGAAGGAATCAAATATGATGAG7TTGCT7TATAAA.CTCT 
G7AAACTTCCTATATTAGTAGGAAAAAGCGGGCTAGAG7TATTTCTTTCT 
CAA77CGAAGCAGCCA7AGA7AGCGAC777CCAAA77A7CAGG7GCCCA7 
C7A7CT77CA7AC777ACAACAAAA7G7C7G7CAC7AC7CAAAGCAA7GC 
A7A7GGC77AGA7C7CAAC7CACACCCCGAGGA7CC7AAAGGGA777GC7 
7777AT7CC7AATG77777GGA7GG777GA777A777C7AAC77GAAC77 
A77AA7C77G7ACCAGAACCAAGA7G7GACAGA7G7^AAACGC7GAAAC7C 
7CACAA7AC77A7CGAAC77777A7CGAAAGAGCAAC7CAA7GG7C7GAG 
G7CA77CACACCA7AAGC7GCC7AGA7G7CC7GAGA7C7777GCAA7CGC 
AGCAAG7C7C7C7GC7GGAAGCA7GGCCAGGCC7G77A77777CCCGAA7 
CAGAAGC7ACAGA7CAGAAiCAGAAAACAAAAGGGCCAA7AC77AAAA7C 
CAAGGAC7A7GGCA7CCA777GCAG77GCAGCCGA7GG7CAA77GCC7G7 
' TCCGAA7GA7A7AC7CCT7GGCGAGGC7AGAAGAAGCAG7GGCAGCA7TC 
ATCC7CGGTCA7TGT7AC7GACGGGACCAAACA7GGGCGGAAAA7CAAC7 
C77C77CGTGCAACA7G7C7GGCCG77A7CT7TGCCCAAG777G7A7AC7 
CG77AGATAA7TAC7C7A77CT77GCAATCAG77C77CAACA7GAA7AA7 
AAA7TCTGTT7TCTG7CTGCAGC77GGC7GC7ACG7GCCG7G7GAG7C77 
GCGAAATCTCCCTCG7GGA7AC7ATCT7CACAAGGC77GGCGCA7C7GA7 
AGAATCA7GACAGGAGAGAGTAAG7777G77C7CAAAA7ACCAA77CC7C 
GAACTAT7TACTCAGATTT7GTC7GAT7GGACAAGG7GG7777GC777T7 
7TTAGGTACCTTTT7GGTAGAATGCAC7GAGACAGCG7CAGT7C77CAGA 
ATGCAACTCAGGATTCACTAG7AA7CC7TGACGAAC7GGGCAGAGGAAC7 
AGTACTTTCGATGGA7ACGCCA7TGCA7ACTCGGTAACC7GC7C77C7CC 
7TCAACTTATACTTGTTGA7CAACAAAAACATGCAA77CATT77GC7GAA 
ACTTATTGAT7TATA7CAGG7T77TCGTCACCTGGTAGAGAAAGT7CAAT 
GTCGGATGCTCT7TGCAACACATTACCACCCTCTCACCAAGGAA7TCGCG 
7CTCACCCACGTGTCACC7CGAAACACATGGC7TGCGCA77CAAA7CAAG 

FlgxiTQ 11 (Continued) 
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ATCTGATTATCAACCACGTGGTTGTGATCAAGACCTAGTGTTCTTGTACC ^^^0 

GTTTAACCGAGGGAGCTTGTCCTGAGAGCTACGGACTTCAAGTGGCACTC ^"^^0 

ATGGCTGGAATACCAAACCAAGTGGTTGAAACAGCATCAGGTGCTGCTCA ^"^50 

AGCCATGAAGAGATCAATTGGGGAAAACTTCAAGTCAAGTGAGCTAAGAT 6800 

CTGAGTTCTCAAGTCTGCATGAAGACTGGCTCAAGTCATTGGTGGGTATT ^850 

TCTCGAGTCGCCCACAACAATGCCCCCATTGGCGAAGATGACTACGACAC 6900 

TTTGTTTTGCTTATGGCATGAGATCAAATCCTCTTACTGTGTTCCCAAAT 6950 

AAATGGCTATGACATAACACTATCTGAAGCTCGTTAAGTCTTTTGCTTCT ^7000 

CTGATGTTT ATTCCTCTTAAAAAATGCTTATATATCAAAAAATTGTTTCC 7050 

TCGATTATAACAAGATTATATATGTATCTGTCGGTTTAGCTATGGTATAT 7100 

AATAT ATGTATGTTCATGAGATTGGTCAAGAGAAATACTCACAAACAGTA 7150 

TATTAAGAAGGAAATATGTTTATGCATTAATTTAAGTTTCAAGATAAACT 7200 

GCAAATAACCTCGACTAAAGTTGCAAAG ACCAAACACAAATTACAAAACT 7250 

TATAAGACTTAAGTTCTGAATTCCCTAAAACCAAAAAAAAAAACAGAACA 7300 

TATTTTGTTGCATCTACAAACAACACAAACCTACATAGTTTATAACTTAC 7350 

TCATCACTGAGATTAACATCAGAATCATTCTCCATTTCTTCATCTTCACT 7400 

CTCATCATCATCACCACCACCATGATGATTCTCCTCCTCTTCACGTAACC 7 4 50 

TAGCAATCTCACTCTGAGCTCTATCAACAATCTGCTTCTTCTGCAACTCC 7500 

AAATCTCTCTGAAAATCAGCTCTCATCTTCTCCAACTCCTTCATTTGCTC 7550 

TTTCTTACTCTTCTCCATCTTCTCATAAACCTTCCCAAACCTCTCAACAG 7600 

AATCCGCCAACATCTTATACGAAGCAGCGTCATTAACCTTCTTCCTCTCG 7650 

TACTCAACCTCATCATCCTC ATCCTCCTCCTCTTCAGAATCACCAGGACT 7700 

ATCCATCATCTCATCAAACCCATTAGACTTATCTAAATAAACCTTAGTGT 7750 

TCATAAACACAAACTCACCTGAATCAACACCACAAGCTAAACCTAAATCC 7800 

GACTTGGGCGAAACAC AAAGCAACAT ATCCAACTTATTGAAAAACGACC A 7850 

TTTACTTGAACCTAAACCTGATTTCTCAACCTTAATCTTCTCTTTTCTAT 7 900 

ACTTCCTCTTCAAGTCATCAATCATTCTCCTACATTGCGTCTCAGATTTC 7 950 

TCCATCCTTAGCTCCTCACTCACTTTCTCAGCTACTTCATTCCAATCCTC 8000 

GTTCCTCAAACTCCTTCTACCCAATTGCAAAAACCTATCTCCCCAAACTT 8050 

CAAGCAACACAA 8062 

Figure 11 (Continuad) 
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Figure 12 



Comments/References: AtMSH3 3' side antisense : AtMSHS 3* (13 = 2104bp) from 
pUC18/13 Sal1/Sst1/T4 into pCW164 BamH1/T4 in Agrobacterium LBA4404 
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Figure 13 



Comments/References: AtMSH6 (SB) 3' side antisens : 62 Sal1/Sst1/T4 (1379bp) 
into pCW164 BamH1/T4 
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8210 EcoR1.Sac1.Kpn1 



Fogyre 14 



Comments/References: AtMSH6 37AtMSH3 3* antisense : AtMSH6 (S8) 3* side {62=1379bp) 
Sal1/Sst1/T4 into pPF13 (pCWl64 AtMSH3 (S5) 3' side (13=2104) antisens)/Sma1. in 
LBA4404 
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8210 EcoRl.Sacl. Kpn1 



Fogyra 11 



Comm®nis/Ref©ronces: AtMSH3 37AtMSH6 3' antisens (D) : AtMSH3 (S5) 3' side 
(13=21 04bp) Sal1/Sst1/T4 into pPFl4 (AIMSH6 (S8) 3'side (62=1379bp) antisense into 
pCW164)/Sma1. in LBA4404 
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8210 EcoRl.Sacl. Kpnl 



Figure 16 



Comments/References: AIMSH3 (S8) complete, sense orientation : pPF26 (3342bp) 
Sma1 into pCWl64 Sma1 
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Figure 17 



Commantfi/Aoforonett: pPZPUl with codA EooRI cassette in Eeofti cite and Hlncia 
GUS casserte in Hind3 etto. Kann. Att gends urtdor PromotflrAarmlnato; 35S 
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p3243 



Figure 19 
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SEQUENCE LISTING 



<1I0> Rhone-Poulenc Agro; Betzner, Andreas Stefan; Doutriaux, 
Marie- Pascale; Freyssinet, Georges; Perez, Pascual . 

<120> Mechods for obtaining plant varieties 

<130> 395498C 

<150> P09745 

<151> 1997-10-10 

<160> 98 

<210> 1 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<221> modif ied_base 

<222> 11 

<223> I 

<220> 

<221> modif ied_base 

<222> 14 

<223> I 

<220> 

<221> modif ied_base 

<222> 17 

<223> I 

<220> 

<223> Degenerate oligonucleotides UPMU used to isolate AtMSH3 and 
AtMSH6 . 

<300> 

<30l> Reenan and Kolodner 

<302> Genetics 

<303> 132 

<306> 963-973 

<307> 1992 

<400> 1 

ctggatccac nggnccnaay atg 23 



<210> 
<211> 
<212> 



2 

23 
DNA 
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<213> Arcificial sequence 

<220> 

<221> modif ied_base 

<222> 15 
<223> I 

<220> 

<221> modif ied_base 

<222> 18 

<223> I 

<220> 

<223> Degenerate oligonucleotides DOMU used to isolate AtMSH3 and 
AtMSH6 . 

<300> 

<3 0l> Reenan and Kolodner 

<302> Genetics 

<303> 132 

<306> 963-973 

<307> 1992 

<400> 2 

ctggatccrt artgngtnrc raa 23 

<210> 3 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer 636 for PGR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 3 

tgctagtgcc tcttgcaagc teat 24 



<210> 4 
<211> 27 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer API for PGR using cDNA of Arabidopsis thaliana ecotype 

Columbia containing adapter sequences ligated to both its 
ends 



<400> 



4 
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27 

ccaccctaat acgactcact atagggc 



<210> 5 
<211> 23 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer AP2 for PGR using cDNA of Arabidopsis thaliana ecotype 

Columbia containing adapter sequences ligated to both its 
ends 

<400> 5 
actcactata gggctcgagc ggc 



<210> 6 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S525 for PGR using cDNA of Arabidopsis 
thaliana ecotype Golumbia 

<400> 6 



aggttctgat tacgtgtgac gcttcactta 



<210> 7 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S51 for PGR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 7 

ggatcgggta ctgggttttg agtgtgagg 29 

<210> 8 

<211> 24 

<212> DNA 

<213> Artificial sequence 



<220> 
<223> 



MSH3 specific primer 635 for PGR using cDNA of Arabidopsis 
thaliana ecotype Columbia 
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<;400> 8 

gcacgcgctt gatggtgttt ccac 24 

<210> 9 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S523 for PGR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 9 

ccagacagta tccagcatgg cagaagta 28 

<210> 10 

<211> 33 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer 1S5 for PGR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 10 

atcccgggat gggcaagcaa aagcagcaga cga 3 3 

<210> 11 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S53 for PGR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 11 

gacaaagagc gaaatgaggc cccttgg 27 



<210> 12 

<211> 1250 

<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 

<223> Clone 52 
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<400> 12 

cccgggatgg gcaagcaaaa gcagcagacg acttctcgtc tcttcgctcc caaacccaaa 60 

tccccgactc acgaaccgaa tccggtagcc gaatcatcaa caccgccacc gaagacatcc 120 

gccactgtat ccttctctcc ttccaagcgt aagctcccct ccgaccacct cgccgccgcg 180 

tcacccaaaa agcctaaact ttctcctcac acccaaaacc cagcacccga tcccaattta 240 

caccaaagat ttctccagag atttctggaa ccctcgccgg aggaatatgt ccccgaaacg 3 00 

tcatcatcga ggaaatacac accattggaa cagcaagtgg tggagctaaa gagcaagtac 3 60 

ccagatgtgg ttttgatggt ggaagctggt tacaggtaca gattcttcgg agaagacgcg 420 

gagatcgcag cacgcgtgtc gggtacctac gctcatatgg atcacaattt catgacggcg 480 

agtgtgccaa cacctcgatt gaatctccac gcgagaagac tggtgaatgc aggatacaag 540 

attggtgtag cgaagcagac cgaaactgca gccactaagt cccatggtgc aaaccggacc 600 

ggcccttctt: cccggggacc gticggcgccg cacaccaaag ccacgcctga agcggctgag 660 

gatacaagtg gtggttgtgg tggtgaagaa ggttttggtt cacagagtaa tttcttggtt 720 

tgtgttgtgg atgagagagt taagtcggag acattaggct gtggcattga aacgagtttt 780 

gatgttagag tcggtgttgt tggcgctgaa atctcgacag gtgaagttgt tcacgaagag 840 

cccaatgaca anctcacgag aagtggatca gaggctgcga tcttgagctt gccaccagct 900 

gagctgttgc tcggccagcc tctttcacaa caaactgaga agtttttggt ggcacatgct 960 

ggacctacct caaacgttcg agtggaacgt gcctcaccgg attgtctcag caatggtaat 1020 

gcagtagacg aggttattcc atcatgcgaa aaaatcagcg caggtaactt agaagatgac 10 80 

aaagaaatga agctggaggc tgctgaaaaa ggaatgtctt gcttgacagt tcacacaatt 1140 

atgaacatgc cacatctgac tgttcaagcc ctcgccctaa cgctttgcca tctcaaacag 12 00 

tttggatttg aaaggatcct ttaccaaggg gcctcatttc gctctttgtc 12 50 



<210> 13 

<211> 34 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer 2S5 for PGR using cDNA of Arabidopsis 
thaliana ecotype Columbia 
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< 4 0 0 > 13 

accccgggcc aaaacgaaca agccggtctt agtc 34 

<210> 14 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S52 for PCR using cDNA of Arabidopsis 
chaliana ecotype Columbia 

<400> 14 

gccacacctg actgttcaag ccctcgc 27 

<210> 15 

<211> 2110 

<212> DNA 

<213> Arabidopsis zhaliana ecotype ColurrJ^ia 

<223> Clone 13 

<400> 15 

gccacatctg actgttcaag ccctcgccct aacgttttgc catctcaaac agcttggatt 60 

tgaaaggatc ctttaccaag gggcctcatt tcgctctttg tcaagtaaca cagagatgac 120 

tctcccagcc aatactccgc aacagttgga ggttgtgaaa aacaattcag atggatcgga 180 

atctggctcc ttattccaca atatgaatca cacacttaca gtatatggtt ccaggcttct 240 

tagacactgg gtgactcatc ctctatgcga tagaaatttg atatctgctc ggcttgatgc 3 00 

tgtttctgag atttccgctt gcatgggatc tcatagttct tcccagccca gcagtgagtt 3 60 

ggttgaagaa ggttctgaga gagcaattgt atcacctgag ttttatctcg tgctctcctc 420 

agtcttgaca gctatgtcta gatcatctga tattcaacgt ggaataacaa gaatctttca 480 

tcggactgct aaagccacag agttcattgc agttatggaa gctattttac ttgcggggaa 540 

gcaaattcag cggcttggca taaagcaaga ctctgaaatg aggagtatgc aatctgcaac 600 

tgcgcgatct actcttttga gaaaattgat ttctgttatt tcatcccctg ttgtggttga 660 

caatgccgga aaactcctct ctgccccaaa taaggaagcg gctgttcgag gtgacttgct 720 

cgacatacta accacttcca gcgaccaatt tcctgagctt gctgaagctc gccaagcagt 780 

ttcagtcatc agggaaaagc cggattcctc gatagcttca tctcgcaaga agctcgctat 840 
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ccgaaatttg 


gaacctcttc 


aagtgccggg 


gaccacacat 


ttgatagagc 


cgcccgttga 


900 


ccccaaggcc 


cccacgaatt 


gggcgaaagt 


aaatagcacc 


aagaagacta 


ttcgatatca 


960 


ccccccagaa 


atagtagctg 


gcttggacga 


gctagctcta 


gcaactgaac 


acctcgccat 


1020 


tgtgaaccga 


gcttcgtggg 


atagttccct 


caagagcttc 


agcagatacc 


acacagatct 


1080 


taaggctgcc 


gttcaa'gctc 


ttgctgcact 


ggactgtttg 


cacccccttt 


caactctatc 


1140 


tagaaacaag 


aactatgtcc 


gtcccgagtt 


tgtggatgac 


tgtgaaccag 


ttgagataaa 


1200 


catacagtct 


ggtcgtcacc 


ccgtactgga 


gactatatta 


caagataact 


tcgtcccaaa 


1260 


tgacacaatt 


ttgcatgcag 


aaggggaata 


ttgccaaatt 


atcaccggac 


ctaacatggg 


1320 


aggaaagagc 


tgctataccc 


gtcaagttgc 


tttaatttcc 


ataatggccc 


aggttggttc 


1380 


ctttgtacca 


gcgtcattcg 


ccaagccgca 


cgtgcttgat 


ggtgttttca 


ctcggatggg 


1440 


tgcttcagac 


agcatccagc 


atggcagaag 


taccttccta 


gaagaattaa 


gtgaagcgtc 


1500 


acacataatc 


agaacctgtt 


ctcctcgttc 


gcntgttata 


tcagatgagc 


ttggaagagg 


1560 


cactagcaca 


cacgacggtg 


cagccaccgc 


ccatgcaaca 


ttacagcatc 


ccccagcaga 


1620 


aaagagatgt 


ccggctcttt 


ttgtcacgca 


ttaccccgaa 


acagccgaga 


ccagtaacgg 


1680 


attcccaggt 


tccgctggga 


cataccangt 


ctcgtatccg 


acattgcaga 


aggataaagg 


1740 


cagttatgat 


catgatgatg 


cgacccacct 


atataagctt 


gtgcgtggtc 


tttgcagcag 


1800 


gagctttggt 


tttaaggttg 


ctcagcttgc 


ccagataccc 


ccatcatgta 


tacgtcgagc 


1860 


catctcaacg 


gctgcaaaat 


tggaagccga 


ggtacgtgca 


agagagagaa 


atacacgcat 


1920 


gggagaacca 


gaaggacatg 


aagaaccgag 


aggcgcagaa 


gaatctattt 


cggctctagg 


1980 


tgacttgtct 


gcagacctga 


aatttgctct 


ctctgaagag 


gacccttgga 


aagcattcga 


2040 


gtttttaaag 


catgcttgga 


agattgctgg 


caaaatcaga 


ctaaaaccaa 


cttgttcatt 


2100 



ttgacccggg 2110 

<210> 16 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S51 for PGR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 16 
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ggaccgggca ccgggccntg agtgtgagg 2 9 



<210> 17 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S525 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 17 



aggctctgat tatgtgtgac gctttactta 30 



<210> 18 

<211> 3522 

<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 
<220> 

<221> CDS 

<222> (100) .... (3342) 

<223> AtMSH3 full-length cDNA and deduced sequence of the encoded 
polypeptide 

<400> 18 

cctaagaaag cgcgcgaaaa ttggcaaccc aagttcgcca tagccacgac cacgaccttc 6 0 

catttctctt aaacggagga gattacgaat aaagcaatt 99 

atg ggc aag caa aag cag cag acg att tct cgt ttc ttc get ccc aaa 147 
Met Gly Lys Gin Lys Gin Gin Thr lie Ser Arg Phe Phe Ala Pro L.ys 
15 10 15 

ccc aaa tec ccg act cac gaa ccg aat ccg gta gcc gaa tea tea aca 195 
Pro Lys Ser Pro Thr His Glu Pro Asn Pro Val Ala Glu Ser Ser Thr 
20 25 30 

ccg cca ccg aag ata tec gcc act gta tec ttc tct cct tec aag cgt 243 
Pro Pro Pro Lys lie Ser Ala Thr Val Ser Phe Ser Pro Ser Lys Arg 
35 40 45 

aag ett etc tec gae cac etc gcc gcc gcg tea ccc aaa aag cct aaa 291 
Lys Leu Leu Ser Asp His Leu Ala Ala Ala Ser Pro Lys Lys Pro Lys 
50 55 60 



ctt tct cct cac act caa aac cca gta ccc gat ccc aat tta cac caa 
Leu Ser Pro His Thr Gin Asn Pro Val Pro Asp Pro Asn Leu His Gin 
65 70 75 80 



339 
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aga ctt ccc 
Arg Phe Leu 



gaa acg tea 
Glu Thr Ser 



gag eta aag 
Glu Leu Lys 
115 

tac agg tac 
Tyr Arg Tyr 
130 

ttg ggt att 
Leu Gly lie 
145 

cca aca ttt 
Pro Thr Phe 



tac aag ate 
Tyr Lys lie 



cat ggt gca 
His Gly Ala 
195 

tat acc aaa 
Tyr Thr Lys 
210 

ggt ggt gaa 
Gly Gly Glu 
225 

gtg gat gag 
Val Asp Glu 



agt ttt gat 
Ser Phe Asp 



gaa gtt gtt 
Glu Val Val 
275 



cag aga ttt 
Gin Arg Phe 
85 

tea teg agg 
Ser Ser Arg 
100 

age aag tac 
Ser Lys Tyr 



aga ttc tte 
Arg Phe Phe 



tac get cat 
Tyr Ala His 
150 

cga ttg aat 
Arg Leu Asn 
165 

ggc gta gtg 
Gly Val Val 
180 

aac egg ace 
Asn Arg Thr 



gee acg ett 
Ala Thr Leu 



gaa ggt ttt 
Glu Gly Phe 
230 

aga gtt aag 
Arg Val Lys 
245 

gtt aga gtc 
Val Arg Val 
260 

tat gaa gag 
Tyr Glu Glu 



ctg gaa ccc 
Leu Glu Pro 



aaa cac aca 
Lys Tyr Thr 
105 

cca gat gtg 
Pro Asp Val 
120 

gga gaa gac 
Gly Glu Asp 
135 

atg gat cac 
Met Asp His 



ttc cac gtg 
Phe His Val 



aag cag act 
Lys Gin Thr 
185 

ggc cct ttt 
Gly Pro Phe 
200 

gaa gcg get 
Glu Ala Ala 
215 

ggt tea cag 
Gly Ser Gin 



teg gag aca 
Ser Glu Thr 



ggt gtt gtt 
Gly Val Val 
265 

tte aat gat 
Phe Asn Asp 
280 



9 

teg eeg gag 
Ser Pro Glu 
90 

cca ttg gaa 
Pro Leu Glu 



gtt ttg atg 
Val Leu Met 



gcg gag ate 
Ala Glu He 
140 

aat tte atg 
Asn Phe Met 
155 

aga aga ctg 
Arg Arg Leu 
170 

gaa act gca 
Glu Thr Ala 



ttc egg gga 
Phe Arg Gly 



gag gat ata 
Glu Asp lie 
220 

agt aat tte 
Ser Asn Phe 
235 

tta ggc tgt 
Leu Gly Cys 
250 

ggc gtt gaa 
Gly Val Glu 



aat tte atg 
Asn Phe Met 



gaa tat gtt 
Glu Tyr Val 
95 

cag caa gtg 
Gin Gin Val 
110 

gtg gaa gtt 
Val Glu Val 
125 

gea gca cgc 
Ala Ala Arg 



acg gcg agt 
Thr Ala Ser 



gtg aat gca 
Val Asn Ala 
175 

gee att aag 
Ala He Lys 
190 

ctg teg gcg 
Leu Ser Ala 
205 

agt ggt ggt 
Ser Gly Gly 



ttg gtt tgt 
Leu Val Cys 



ggt att gaa 
Gly He Glu 
255 

att teg aca 
He Ser Thr 
2'70 

aga agt gga 
Arg Ser Gly 
285 



ccc 387 
Pro 



gtg 435 
Val 



ggt 483 
Gly 



gtg 531 
Val 



gtg 579 

Val 

160 

gga 627 

Gly 



tec 675 
Ser 



ttg 723 
Leu 



tgt 771 
Cys 



gtt 819 

Val 

240 

atg 867 
Met 



ggt 915 
Gly 



tta 963 
Leu 
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gag gcc gtg 
Glu Ala Val 
250 

cct ccc tea 
Pro Leu Ser 
305 

acc tea aac 
Thr Ser Asn 



ggt aac gca 
Gly Asn Ala 



ggt aac tta 
Gly Asn Leu 
355 

gga atg tct 

Gly Mez Ser 
370 

act gtt caa 
Thr Val Gin 
385 

ctt gaa agg 
Phe Glu Arg 



aac aca gag 
Asn Thr Glu 



gtg aaa aat 
Val Lys Asn 
435 

atg aat cac 
Met Asn His 
450 

gtg act cat 
Val Thr His 
465 

gcc gtt tct 
Ala Val Ser 



att ctg age 
lie Leu Ser 



caa caa acc 
Gin Gin Thr 
310 

gtt cga gtg 
Val Arg Val 
325 

gta gat gag 
Val Asp Glu 
340 

gaa gat gat 

Glu Asp Asp 



tgc ttg aca 

Cys Leu Thr 



gcc etc gcc 
Ala Leu Ala 
390 

ate etc tac 
lie Leu Tyr 
405 

atg act etc 
Met Thr Leu 
420 

aac tea gat 
Asn Ser Asp 



aca ctt aca 
Thr Leu Thr 



ccc cca cgc 
Pro Leu Cys 
470 

gag ate tct 
Glu lie Ser 
485 



ttg cca cca 
Leu Ser Pro 
295 

gag aag ttc 
Glu Lys Phe 



gaa cgt gcc 
Glu Arg Ala 



gtt att tea 
Val lie Ser 
345 

aaa gaa atg 

Lys Glu Met 
360 

gtt cac aca 
Val His Thr 
375 

eta acg ttt 
Leu Thr Phe 



caa ggg gcc 
Gin Gly Ala 



tea gcc aat 
Ser Ala Asn 
425 

gga teg gaa 
Gly Ser Glu 
440 

gta cat get 
Val Tyr Gly 
455 

gat aga aac 
Asp Arg Asn 



gcc cgc atg 
Ala Cys Met 



10 

gcc gag ccg 
Ala Glu Leu 
300 

ccg gcg gca 
Leu Val Ala 
315 

tea ccg gac 
Ser Leu Asp 
330 

tta tgc gaa 
Leu Cys Glu 



aag ccg gag 

Lys Leu Glu 



act acg aac 
He Mec Asn 
380 

Cgc cac ere 
Cys His Leu 
395 

cca ccc cgc 
Ser Phe Arg 
410 

acc ccg caa 
Thr Leu Gin 



tct ggc ccc 
Ser Gly Ser 



ccc agg ccc 
Ser Arg Leu 
460 

ccg aca ccc 
Leu He Ser 
475 

gga tct cat 
Gly Ser His 
490 



ccg ccc ggc 
Leu Leu Gly 



cac gcc gga 
Mec Ala Gly 



cgc ttc- age 
Cys Phe Ser 
335 

aaa ate age 
Lys He Ser 
350 

get gcc gaa 
Ala Ala Glu 
365 

acg cca cac 
Mec Pro His 



aaa cag ctt 
Lys Gin Phe 



ccc ccg cca 
Ser Leu Ser 
415 

cag ccg gag 
Gin Leu Glu 
430 

cca cce cac 
Leu Phe His 
445 

CCC aga cac 
Leu Arg His 



gcc egg etc 
Ala Arg Leu 



age cet tec 
Ser Ser Ser 
495 



cag 1011 
Gin 



ccc 1059 

Pro 

320 

aac 1107 
Asn 



gca 1155 
Ala 



aaa 1203 
Lys 



ccg 1251 
Leu 



gga 12 9 9 

Gly 

400 

age 1347 
Ser 



gcc 1395 
Val 



aac 1443 
Asn 



egg 1491 
Trp 



gac 153 9 

Asp 

480 

eag 1587 
Gin 
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etc age age 
Leu Ser Ser 



ccc gag ctt 
Pro Glu Phe 
515 

tea tct gat 
Ser Ser Asp 
530 

aaa gcc aca 
Lys Ala Thr 
545 

aag caa att 
Lys Gin lie 



atg caa tct 
Met Gin Ser 



gtt att tea 
Val He Ser 
595 

gcc eta aat 
Ala Leu Asn 
610 

ate act tec 
He Thr Ser 
625 

gtt eta gtc 
Val Leu Val 



aag aag etc 
Lys Lys Leu 



aca cat teg 
Thr His Leu 
675 

gcg aaa gta 
Val Lys Val 
690 



gag ttg get 
Glu Leu Val 
500 

tat etc gtg 
Tyr Leu Val 



att caa egt 
He Gin Arg 



gag ttc att 
Glu Phe He 
550 

cag egg ett 
Gin Arg Leu 
565 

gea act gtg 
Ala Thr Val 
580 

tec cet gtt 
Ser Pro Val 



aag gaa gcg 
Lys Glu Ala 



age gae caa 
Ser Asp Gin 
630 

ate agg gaa 
He Arg Glu 
645 

get att cga 
Ala He Arg 
660 

ata gag ctg 
He Glu Leu 



aat age ace 
Asn Ser Thr 



gaa gaa ggt 
Glu Glu Gly 
505 

etc tec tea 
Leu Ser Ser 
520 

gga ata aca 
Gly He Thr 
535 

gea gtt atg 
Ala Val Met 



gge ata aag 

Gly He Lys 



cga tct act 
Arg Ser Thr 
585 

gtg gtt gae 
Val Val Asp 
600 

get gtt cga 
Ala Val Arg 
615 

ttt cet gag 
Phe Pro Glu 



aag ctg gat 
Lys Leu Asp 



aat ttg gaa 
Asn Leu Glu 
665 

ccc gtt gat 
Pro Val Asp 
680 

aag aag act 
Lys Lys Thr 
695 



11 

tct gag aga 
Ser Glu Arg 



gtc ttg aca 
Val Leu Thr 



aga ate ttt 
Arg He Phe 
540 

gaa get att 
Glu Ala He 
555 

caa gae tct 

Gin Asp Ser 
570 

ett ttg aga 
Leu Leu Arg 



aat gcc gga 
Asn Ala Gly 



ggt gae ttg 

Gly Asp Leu 
620 

ett get gaa 

Leu Ala Glu 
635 

tec teg ata 

Ser Ser He 
650 

ttt ett caa 

Phe Leu Gin 



tec aag gtc 
Ser Lys Val 



att cga tat 
He Arg Tyr 
700 



gea att gta 
Ala He Val 
510 

get atg tct 
Ala Met Ser 
525 

cat egg act 
His Arg Thr 



tta ett gcg 
Leu Leu Ala 



gaa atg agg 

Glu Met Arg 
575 

aaa ttg att 
Lys Leu He 
590 

aaa ett etc 
Lys Leu Leu 
605 

etc gae ata 
Leu Asp He 



get egc caa 
Ala Arg Gin 



get tea ttt 
Ala Ser Phe 
655 

gtg teg ggg 
Val Ser Gly 
670 

cet atg aat 
Pro His Asn 
685 

eat ccc cca 
His Pro Pro 



tea 1635 
Ser 



aga 1683 
Arg 



get 1731 
Ala 



ggg 1779 

Gly 

560 

agt 1B27 

Ser 



tct 1875 
Ser 



tct 1923 
Ser 



eta 1971 
Leu 



gea 2019 

Ala 

640 

egc 2067 
Arg 



ate 2115 
He 



tgg 216 3 

Trp 



gaa 2211 
Glu 
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ata gca get 
He Val Ala 
705 

ate gtg aac 
He Val Asn 



tac tac aca 
Tyr Tyr Thr 



tgt ctg cac 
Cys Leu His 
755 

ccc gag ttt 
Pro Glu Phe 
770 

ggt cgc cat 
Gly Arg His 
785 

aat gac aca 
Asn Asp Thr 



gga cct aac 
Gly Pro Asn 



att tec ata 
He Ser lie 
835 

aag ctg cac 
Lys Leu His 
850 

agt ate cag 
Ser He Gin 
865 

tea cac ata 
Ser His He 



gag ctt gga 
Glu Leu Gly 



gge ttg gat 
Gly Leu Asp 
710 

cga get teg 
Arg Ala Ser 
725 

gat ttt aag 
Asp Phe Lys 
740 

tec ctt tea 
Ser Leu Ser 



gtg gat gac 
Val Asp Asp 



ccc gca ctg 
Pro Val Leu 
790 

att ttg cat 
He Leu His 
805 

atg gga gga 
Met Gly Gly 
820 

atg get cag 
Met Ala Gin 



gtg ctt gat 
Val Leu Asp 



cat gge aga 
His Gly Arg 
870 

ate aga acc 
He Arg Thr 
885 

aga gge act 
Arg Gly Thr 
900 



gag eta get 
Glu Leu Ala 



tgg gat agt 
Trp Asp Ser 



get gee gtt 
Ala Ala Val 
745 

act eta tet 
Thr Leu Ser 
760 

tgt gaa cca 

Cys Glu Pro 
775 

gag acz ata 
Glu Thr He 



gca gaa ggg 
Ala Glu Gly 



aag age tgc 
Lys Ser Cys 
825 

gtt get tec 
Val Gly Ser 
840 

ggt gtt ttc 
Gly Val Phe 
855 

agt ace ttt 
Ser Thr Phe 



tgt tec - tet 
Cys Ser Ser 



age aca cac 
Ser Thr His 
905 
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eta gca act 
Leu Ala Thr 
715 

tec etc aag 
Phe Leu Lys 
730 

caa get ctt 
Gin Ala Leu 



aga aac aag 
Arg Asn Lys 



gtt gag ata 
Val Glu He 
780 

tea caa gat 
Leu Gin Asp 
795 

gaa tat tgc 
Glu Tyr Cys 
810 

tat ate cgt 
Tyr He Arg 



ttt gta cca 
Phe Val Pro 



act egg atg 
Thr Arg Met 
860 

eta gaa gaa 
Leu Glu Glu 
875 

cgt teg ctt 
Arg Ser Leu 
890 

gac ggt gta 
Asp Gly Val 



gaa cat ctt 
Glu His Leu 



agt ttc agt 
Ser Phe Ser 
735 

get gca ctg 
Ala Ala Leu 
750 

aac tat gtc 
Asn Tyr Val 
765 

aac ata cag 

Asn He Gin 



aac ttc gtc 
Asn Phe Val 



caa att ate 
Gin He He 
815 

caa gtt get 
Gin Val Ala 
830 

gcg tea ttc 
Ala Ser Phe 
845 

ggt get tea 
Gly Ala Ser 



tta agt gaa 
Leu Ser Glu 



gtt ata tta 
Val He Leu 
895 

gee att gcc 
Ala He Ala 
910 



gee 2259 

Ala 

720 

aga 2307 
Arg 



gac 2355 
Asp 



cgt 2403 
Arg 



tet 2451 

Ser 



cca 2499 

Pro 

800 

ace 2547 
Thr 



tta 2595 
Leu 



gee 2643 
Ala 



gac 2691 
Asp 



gcg 2739 

Ala 

880 

gat 2787 
Asp 



tat 2835 
Tyr 
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gca aca tta cag cat etc eta gca gaa aag aga tgc ttg gtt ctt ttt 2883 
Ala Thr Leu Gin His Leu Leu Ala Glu Lys Arg Cys Leu Val Leu Phe 
915 920 925 

gtc acg cat tac cct gaa ata get gag ate agt aac gga ttc cca ggt 2931 
Val Thr His Tyr Pro Glu He Ala Glu He Ser Asn Gly Phe Pro Gly 
930 935 940 

tct gtt ggg aca tac cat gtc teg tat ctg aca ttg cag aag gat aaa 2979 
Ser Val Gly Thr Tyr His Val Ser Tyr Leu Thr Leu Gin Lys Asp Lys 
945 950 955 960 

ggc agt tat gat cat gat gat gtg acc tac eta tat aag ctt gtg cgt 3027 
Gly Ser Tyr Asp His Asp Asp Val Thr Tyr Leu Tyr Lys Leu Val Arg 
965 970 975 

ggt ctt tgc age agg age ttt ggt ttt aag gtt get cag ctt gcc cag 3075 

Gly Leu Cys Ser Arg Ser Phe Gly Phe Lys Val Ala Gin Leu Ala Gin 
980 985 990 

ata cct cca tea tgt ata cgt ega gee att tea acg get gca aaa ttg 3123 
He Pro Pro Ser Cys He Arg Arg Ala He Ser Met Ala Ala Lys Leu 
995 1000 1005 

gaa get gag gta cgt gca aga gag aga aat aca ege atg gga gaa cca 3171 
Glu Ala Glu Val Arg Ala Arg Glu Arg Asn Thr Arg Met Gly Glu Pro 
1010 1015 1020 

gaa gga cat gaa gaa eeg aga ggc gca gaa gaa tct att teg get eta 3219 
Glu Gly His Glu Glu Pro Arg Gly Ala Glu Glu Ser He Ser Ala Leu 
1025 1030 1035 1040 

ggt gae ttg ttt gca gae ctg aaa ttt get etc cct gaa gag gac cct 3267 
Gly Asp Leu Phe Ala Asp Leu Lys Phe Ala Leu Ser Glu Glu Asp Pro 
1045 1050 1055 

tgg aaa gca ttc gag ttt tta aag eat get tgg aag att get ggc aaa 3315 
Trp Lys Ala Phe Glu Phe Leu Lys His Ala Trp Lys He Ala Gly Lys 
1060 1065 1070 

ate aga eta aaa cca act tgt tea ttt tgatctaate ttaacattat 3362 
He Arg Leu Lys Pro Thr Cys Ser Phe 
1075 1080 

agcaactgca aggtcttgat eatctgttag ttgegtacta aettatgtgt attagtataa 3422 

caagaaaaga gaattagaga gatggattct aatccggtgt tgcagtaeat cttttctcea 3482 

cccgcataaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3 522 



<210> 19 
<211> 1081 
<212> PRT 
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<213> Arahldopsis thallana ecotype Columbia 

<223> Polypeptide MSH3 

<400> 19 

Met Gly Lys Gin Lys Gin Gin Thr He Ser Arg Phe Phe Ala Pro Lys 
15 10 15 

Pro Lys Ser Pro Thr His Glu Pro Asn Pro Val Ala Glu Ser Ser Thr 
20 25 30 

Pro Pro Pro Lys He Ser Ala Thr Val Ser Phe Ser Pro Ser Lys Arg 
35 40 45 

Lys Leu Leu Ser Asp His Leu Ala Ala Ala Ser Pro Lys Lys Pro Lys 
50 55 60 

Lsu Ser Pro His Thr Gin Asn Pro Val Pro Asp Pro Asn Leu His Gin 
65 70 75 80 

Arg Phe Leu Gin Arg Phe Leu Glu Pro Ser Pro Glu Glu Tyr Val Pro 
85 90 95 

Glu Thr Ser Ser Ser Arg Lys Tyr Thr Pro Leu Glu Gin Gin Val Val 
100 105 110 

Glu Leu Lys Ser Lys Tyr Pro Asp Val Val Leu Met Val Glu Val Gly 
115 120 125 

Tyr Arg Tyr Arg Phe Phe Gly Glu Asp Ala Glu He Ala Ala Arg Val 
130 135 140 

Leu Gly lie Tyr Ala His Met Asp His Asn Phe Met Thr Ala Ser Val 
145 150 155 160 

Pro Thr Phe Arg Leu Asn Phe His Val Arg Arg Leu Val Asn Ala Gly 
165 170 175 

Tyr Lys He Gly Val Val Lys Gin Thr Glu Thr Ala Ala He Lys Ser 
180 185 190 

His Gly Ala Asn Arg Thr Gly Pro Phe Phe Arg Gly Leu Ser Ala Leu 
195 200 205 

Tyr Thr Lys Ala Thr Leu Glu Ala Ala Glu Asp He Ser Gly Gly Cys 
210 215 220 

Gly Gly Glu Glu Gly Phe Gly Ser Gin Ser Asn Phe Leu Val Cys Val 
225 230 235 240 



Val Asp Glu Arg Val Lys Ser Glu Thr Leu Gly Cys Gly He Glu Met 
245 250 255 
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Ser Phe Asp Val Arg Val Gly val Val Gly Val Glu He Ser Thr Gly 
260 265 270 

Glu Val Val Tyr Glu Glu Phe Asn Asp Asn Phe Met Arg Ser Gly Leu 
275 280 285 

Glu Ala Val He Leu Ser Leu Ser Pro Ala Glu Leu Leu Leu Gly Gin 
290 295 300 

Pro Leu Ser Gin Gin Thr Glu Lys Phe Leu Val Ala Met Ala Gly Pro 
305 310 315 320 

Thr Ser Asn Val Arg Val Glu Arg Ala Ser Leu Asp Cys Phe Ser Asn 
325 330 335 

Gly Asn Ala Val Asp. Glu Val He Ser Leu Cys Glu Lys lie Ser Ala 
340 345 350 

Gly Asn Leu Glu Asp Asp Lys Glu Met Lys Leu Glu Ala Ala Glu Lys 
355 360 365 

Gly Met Ser Cys Leu Thr Val His Thr He Met Asn Met Pro His Leu 
370 375 380 

Thr Val Gin Ala Leu Ala Leu Thr Phe Cys His Leu Lys Gin Phe Gly 
385 390 395 400 

Phe Glu Arg lie Leu Tyr Gin Gly Ala Ser Phe Arg Ser Leu Ser Ser 
405 410 415 

Asn Thr Glu Met Thr Leu Ser Ala Asn Thr Leu Gin Gin Leu Glu Val 
420 425 430 

Val Lys Asn Asn Ser Asp Gly Ser Glu Ser Gly Ser Leu Phe His Asn 
435 440 445 

Met Asn His Thr Leu Thr Val Tyr Gly Ser Arg Leu Leu Arg His Trp 
450 455 460 

Val Thr His Pro Leu Cys Asp Arg Asn Leu He Ser Ala Arg Leu Asp 
465 470 475 480 

Ala Val Ser Glu He Ser Ala Cys Met Gly Ser His Ser Ser Ser Gin 
485 490 495 

Leu Ser Ser Glu Leu Val Glu Glu Gly Ser Glu Arg Ala He Val Ser 
500 505 510 

Pro Glu Phe Tyr Leu Val Leu Ser Ser Val Leu Thr Ala Met Ser Arg 
515 520 525 



Ser Ser Asp He Gin Arg Gly He Thr Arg He Phe His Arg Thr Ala 
530 535 540 
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Lys Ala Thr Giu Phe He Ala Val Met Glu Ala He Leu Leu Ala Gly 
545 550 555 560 

Lys Gin lie Gin Arg Leu Gly He Lys Gin Asp Ser Glu Met Arg Ser 
565 570 575 

Met Gin Ser Ala Thr Val Arg Ser Thr Leu Leu Arg Lys Leu He Ser 
580 585 590 

Val He Ser Ser Pro Val Val Val Asp Asn Ala Gly Lys Leu Leu Ser 
595 600 605 

Ala Leu Asn Lys Glu Ala Ala Val Arg Gly Asp Leu Leu Asp He Leu 
610 615 620 

He Thr Ser Ser Asp Gin Phe Pro Glu Leu Ala Glu Ala Arg Gin Ala 
625 630 635 640 

Val Leu Val He Arg Giu Lys Leu Asp Ser Ser He Ala Ser Phe Arg 
645 650 655 

Lys Lys Leu Ala He Arg Asn Leu Glu Phe Leu Gin Val Ser Gly He 
660 665 670 

Thr His Leu He Glu Leu Pro Val Asp Ser Lys Val Pro His Asn Trp 
675 680 685 

Val Lys Val Asn Ser Thr Lys Lys Thr He Arg Tyr His Pro Pro Glu 
690 695 700 

He Val Ala Gly Leu Asp Glu Leu Ala Leu Ala Thr Glu His Leu Ala 
705 710 715 720 

He Val Asn Arg Ala Ser Trp Asp Ser Phe Leu Lys Ser Phe Ser Arg 
725 730 735 

Tyr Tyr Thr Asp Phe Lys Ala Ala Val Gin Ala Leu Ala Ala Leu Asp 
740 745 750 

Cys Leu His Ser Leu Ser Thr Leu Ser Arg Asn Lys Asn Tyr Val Arg 
755 760 765 

Pro Glu Phe Val Asp Asp Cys Glu Pro Val Glu He Asn He Gin Ser 
770 775 780 

Gly Arg His Pro Val Leu Glu Thr He Leu Gin Asp Asn Phe Val Pro 
785 790 795 800 



Asn Asp Thr He Leu His Ala Glu Gly Glu Tyr Cys Gin He He Thr 
805 810 815 



Gly Pro Asn Met Gly Gly Lys Ser Cys Tyr He Arg Gin Val Ala Leu 
820 825 830 
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He Ser He Met Ala Gin Val Gly Ser Phe Val Pro Ala Ser Phe Ala 
835 840 845 

Lys Leu His Val Leu Asp Gly Val Phe Thr Arg Met Gly Ala Ser Asp 
850 855 860 

Ser He Gin His Gly Arg Ser Thr Phe Leu Glu Glu Leu Ser Glu Ala 
865 870 875 880 

Ser His He He Arg Thr Cys Ser Ser Arg Ser Leu Val He Leu Asp 
885 890 895 

Glu Leu Gly Arg Gly Thr Ser Thr^ His Asp Gly Val Ala He Ala Tyr 
900 905 910 

Ala Thr Leu Gin His Leu Leu Ala Glu Lys Arg Cys Leu Val Leu Phe 
915 920 925 

Val Thr His Tyr Pro Glu He Ala Glu He Ser Asn Gly Phe Pro Gly 
930 935 940 

Ser Val Gly Thr Tyr His Val Ser Tyr Leu Thr Leu Gin Lys Asp Lys 
945 950 955 960 

Gly Ser Tyr Asp His Asp Asp Val Thr Tyr Leu Tyr Lys Leu Val Arg 
965 970 975 

Gly Leu Cys Ser Arg Ser Phe Gly Phe Lys Val Ala Gin Leu Ala Gin 
980 985 990 

"^Ile Pro Pro Ser Cys He Arg Arg Ala He Ser Met Ala Ala Lys Leu 
995 1000 1005 

Glu Ala Glu Val Arg Ala Arg Glu Arg Asn Thr Arg Met Gly Glu Pro 
1010 1015 1020 

Glu Gly His Glu Glu Pro Arg Gly Ala Glu Glu Ser He Ser Ala Leu 
1025 1030 1035 1040 

Gly Asp Leu Phe Ala Asp Leu Lys Phe Ala Leu Ser Glu Glu Asp Pro 
1045 1050 1055 

Trp Lys Ala Phe Glu Phe Leu Lys His Ala Trp Lys He Ala Gly Lys 
1060 1065 1070 

He Arg Leu Lys Pro Thr Cys Ser Phe 
1075 1080 



<210> 20 

<211> 24 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> MSH6 specific primer 638 for PCR using cDNA of Arabidopsis 

chaliana ecotype Columbia 

<400> 20 

tctctaccag gcgacgaaaa accg 24 



<210> 21 
<211> 28 
<212> DNA 

<213> Artificial sequence 

<220.> 

<223> Primer S81 for PCR using cDNA of Arabidopsis thaliana ecotype 

Columbia 

<400> 21 

cgtcgccrtt agcaccccct tccctcac 28 



<210> 22 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH6 specific primer S823 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 22 



gcttggcgca cctaatagaa tcacgacagg 30 



<210> 23 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH6 specific primer 637 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 23 



gacagcgtca gttcttcaga atgc 24 



<210> 
<211> 
<212> 



24 
33 
DNA 
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<213> Artificial sequence 

<220> 

<222> MSH6 specific primer 1S8 for PGR using cDNA of Arabidopsis 

thaliana ecotype Columbia 

<400> 24 

accccgggat gcagcgccag agaccgactt tgt 33 

<210> 25 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH6 specific primer S83 for PGR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 25 

cgcrazctac ggctgcttcg aatcgag 27 

<210> 26 

<211> 1385 

<212> DNA. 

<213> Arabidopsis thaliana ecotype Columbia 

<223> Clone 43 

<40C> 26 

cccgggatgc agcgccagag atcgattttg tctttcttcc aaaaacccac ggcggcgact 60 

acgaagggtt tggtttccgg cgatgctgct agcggcgggg gcggcagcgg aggaccacga 120 

tttaacgtga aggaagggga tgctaaaggc gacgctcctg tacgttttgc tgtttcgaaa 180 

tctgtcgatg aggttagagg aacggatact ccaccggaga aggttccgcg tcgtgtcctg 240 

ccgtctggat ttaagccggc tgaatccgcc ggtgatgctt cgtccctgtt ctccaatatt 3 00 

atgcataagt ttgtaaaagt cgatgatcga gattgttctg gagagaggag ccgagaagat 360 

gttgttccgc tgaatgattc atctctatgt atgaaggcta atgatgttat tccccaattt 420 

cgttccaata acggtaaaac tcaagaaaga aaccatgctt ttagtttcag tgggagagct 4 80 

gaacctagat cagtagaaga tataggagta gatggcgatg ttcccggtcc agaaacacca 54 0 

gggatgcgtc cacgtgcttc tcgcttgaag cgagttctgg aggatgaaat gacttttaag 6 00 

gaggacaagg ttcctgtact ggactccaac aaaaggctga aaatgctcca ggacccggtt 660 
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cgcggagaga agaaagaagt aaacgaagga accaaacctg aatggcccga gtctcctcga 720 

atcagggacg ccaacagaag acgtcccgat gacccccttt acgatagaaa gaccttacac 730 

ataccacccg atgtcttcaa gaaaatgccc gcatcacaaa agcaatattg gagtgttaag 840 

agtgaatata tggacattgt gcttttcctt aaagtgggga aattttatga gctgtatgag 900 

ctagatgcgg aattaggtca caaggagctt gactggaaga tgaccatgag tggtgtggga 960 

aaatgcagac aggttggtat ctctgaaagt gggacagatg aggcagtgca aaagctatta 1020 

gctcgtggat ataaagtcgg acgaaccgag cagctagaaa catctgacca agcaaaagcc 1080 

agaggtgcta atactataat tccaaggaag ctagttcagg tattaactcc atcaacagca 1140 

agcgagggaa acatcgggcc tgatgccgtc catcttcttg ccataaaaga gatcaaaacg 12 00 

gagctacaaa agtgtccaac tgcgtatgga crtgcccttg ttgactgtgc cgccctgagg 1260 

tctcgggcng ggtccaccag cgacgacgca ccatgtgccg cccctggagc gttatcgacg 1320 

caggcccctc caaaggaagc gccanacgac agtaaagggc tatcaagaga agcacaaaag 1330 

gctctaagga aatatacgtt gacagggtcc acggcggtac agttggctcc agtaccacaa 1440 

gtaacggggg acacagatgc tgctggagct agaaatataa cagaatctaa cggatacctt 1500 

aaaggctcct ctgaatcatg gaactgcgct gttgatggcc taaatgaatg tgatgttgcc 1560 

cccagcgctc ttggagagct aaccaaccat ctgtctaggc taaagctaga agatgtactt 1620 

aagcacgggg atatttttcc ataccaagcc cacaggggtt gtcccagaat cgatggccag 1680 

acgacggtaa atcttgagat atttaacaac agctgtgatg gtggcccttc agggaccttg 1740 

tacaaatatc ttgataactg tgttagtcca actggtaagc gacccttaag gaattggatc 18 00 

tgccatccac tcaaagatgt agaaagcatc aataaacggc tcgatgtagt tgaagaattc 1860 

acggcaaact cagaaagtat gcaaatcacc ggccagtatc tccacaaacc tccagactta 1920 

gaaagactgc tcggacgcat caagtccagc gttcgatcat cagcctctgt gttgcctgct 1980 

cttctgggga aaaaagtgct gaaacaacga gttaaagcat ttgggcaaat tgtgaaaggg 2 040 

ttcagaagtg gaattgatct gttgttggct ctacagaagg aatcaaatat gacgagtttg 2100 

ctttataaac tctgtaaact tcctatatta gtaggaaaaa gcgggctaga gttatttctt 2160 

tctcaattcg aagcagccat agatagcg 2188 
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<210> 27 
<211> 1385 
<212> DMA 

<213> Arabidopsis chaliana ecotype Columbia 

<223> Clone 62 

<400> 27 

catcagcctc cgtgttgcct gctcttctgg ggaaaaaagt gccgaaacaa cgagttaaag 60 

cacttgggca aattgtgaaa gggttcagaa gtggaactga cctgccgttg gctctacaga 120 

aggaatcaaa tatgatgagt ttgctttata aactccgtaa acttcctata ttagtaggaa 180 

aaagcgggct agagttattt ctttctcaat tcgaagcagc catagatagc gactttccaa 240 

attaccagaa ccaagatgtg acagatgaaa acgctgaaac tctcacaata cttatcgaac 3 00 

cttccatcga aagagcaact caatggtctg aggtcattca caccataagc ngccnagatg 360 

tcccgagatc ttttgcaatc gcagcaagcc tctctgctgg aagcatggcc aggcctgtca 420 

ttccccccga atcagaagct acagaccaga atcagaaaac aaaagggcca acactcaaaa 480 

tccaaggact atggcatcca cctgcagccg cagccgacgg tcaactgcct gccccgaatg 540 

atatactcct tggcgaggct agaagaagca gcggcagcat tcacccccgg ccattgttac 600 

tgacgggacc aaacatgggc ggaaaatcaa ctcttcttcg tgcaacatgt ctggccgtta 660 

tctttgccca acttggctgc tacgcgccgt gtgagtcctg cgaaatctcc ctcgtggata 720 

ctatcctcac aaggcttggc gcatctgata gaatcacgac aggagagagc acccttccgg 780 

tagaatgcac tgagacagcg tcagctcttc agaatgcaac ccaggactca ctagtaatcc 840 

ctgacgaact gggcagagga actagtactt tcgatggata cgccattgca tactcggttt 900 

ttcgccacct ggtagagaaa gttcaatgtc ggatgctcct cgcaacacat taccaccctc 960 

tcaccaagga attcgcgtct cacccacgtg tcacctcgaa acacatggct tgcgcattca 102 0 

aatcaagatc tgattatcaa ccacgtggtt gtgatcaaga cctagtgttc ttgtaccgtt 1080 

taaccgaggg agcttgtcct gagagctacg gacttcaagt ggcactcatg gctggaatac 114 0 

caaaccaagt ggttgaaaca gcatcaggtg ctgctcaagc catgaagaga tcaattgggg 1200 

aaaacttcaa gccaagcgag ctaagacctg agtcctcaag tctgcatgaa gactggctca 126 0 

agtcatcggt gggtacttct cgagtcgccc acaacaatgc ccccattggc gaagatgacc 1320 

acgacacttt gttttgctta tggcatgaga tcaaatcccc ttactgtgtt cccaaataac 138 0 
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ccggg 13 85 



<210> 28 

<211> 34 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH6 specific primer 2S8 for PGR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 26 



atcccgggtt atttgggaac acagtaagag gatt 34 



<210> 29 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<22C> 

<223> MSH6 specific primer S82 fcr PGR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 29 



gcgttcgatc atcagcctct gtgttgc 27 



<210> 30 

<211> 3606 

<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 
<220> 

<221> CDS 

<222> (142) .... (3468) 

<223> AtMSH6 full-length cDNA and deduced sequence of the encoded 
polypeptide 

<400> 30 



aaaagttgag ccctgaggag tatcgtttcc gccatttcta cgacgcaagg cgaaaatttt 60 
tggcgccaat ctttcccccc tttcgaattc tctcagctca aaacatcgtt tctctctcac 120 



tctctctcac aattccaaaa a atg cag cgc cag aga teg ate ttg tct ttc 

Met Gin Arg Gin Arg Ser lie Leu Ser Phe 
15 10 



171 
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tec caa aaa ccc acc gcg gcg act acg aag ggt ttg gtt tec ggc gat 219 
Phe Gin Lys Pro Thr Ala Ala Thr Thr Lys Gly Leu Val Ser Gly Asp 
15 20 25 

get get age ggc ggg ggc ggc age gga gga cca cga ttt aat gtg aag 267 
Ala Ala Ser Gly Gly Gly Gly Ser Gly Gly Pro Arg Phe Asn Val Arg 
30 35 40 

gaa ggg gat get aaa gge gac get cct gta cgt ttt get gtt teg aaa 315 
Glu Gly Asp Ala Lys Gly Asp Ala Ser Val Arg Phe Ala Val Ser Lys 
45 50 55 

tct gtc gat gag gtt aga gga acg gat act cca ccg gag aag gtt ccg 363 
Ser Val Asp Glu Val Arg Gly Thr Asp Thr Pro Pro Glu Lys Val Pro 
60 65 70 

cgt cgt gtc ctg ccg tct gga ttt aag ccg get gaa tec gee gst gat 411 

Arg Arg Val Leu Pro Ser Gly Phe Lys Pro Ala Glu Ser Ala Gly Asp 
75 80 85 90 

get teg tec ctg tte tec aac azz acg eat aag ttt gta aaa gtc gat 459 
Ala Ser Ser Leu Phe Ser Asn lie Met His Lys Phe Val Lys Val Asp 
95 100 105 

gat cga gat tgt tct gga gag agg age cga gaa gat gtt gtt ccg ctg 507 
Asp Arg Asp Cys Ser Gly Glu Arg Ser Arg Glu Asp Val Val Pro Leu 
110 115 120 

aat gat tea tct eta tgt atg aag get aat gat gtt att cct caa ttt 555 
Asn Asp Ser Ser Leu Cys Met Lys Ala Asn Asp Val lie Pro Gin Phe 
125 130 135 

cgt tec aat aat ggt aaa act caa gaa aga aac cat get ttt agt ttc 603 
Arg Ser Asn Asn Gly Lys Thr Gin Glu Arg Asn His Ala Phe Ser Phe 
140 145 150 

agt ggg aga get gaa ctt aga tea gta gaa gat ata gga gta gat gge 651 
Ser Gly Arg Ala Glu Leu Arg Ser Val Glu Asp lie Gly Val Asp Gly 
155 160 165 170 

gat gtt cct ggt cca gaa aca cca ggg atg cgt cca cgt get tct cgc 699 
Asp Val Pro Gly Pro Glu Thr Pro Gly Met Arg Pro Arg Ala Ser Arg 
175 180 185 

ttg aag cga gtt ctg gag gat gaa atg act ttt aag gag gat aag gtt 747 
Leu Lys Arg Val Leu Glu Asp Glu Met Thr Phe Lys Glu Asp Lys Val 
190 195 200 



cct gta ttg gac tct aac aaa agg ctg aaa atg etc cag gat ccg gtt 
Pro Val Leu Asp Ser Asn Lys Arg Leu Lys Met Leu Gin Asp Pro Val 
205 210 215 



795 



wo 99/19492 



PCT/EP98/06977 



24 



cgt gga gag aag aaa gaa gta aac gaa gga acc aaa ttt gaa egg etc 

Cys Gly Giu Lys Lys Giu Val Asn Glu Gly Thr Lys Phe Glu Trp Leu 
220 225 230 

gag cct ccz cga acc agg gac gcc aac aga aga cgt cct gat gac ccc 

Glu Ser Ser Arg lie Arg Asp Ala Asn Arg Arg Arg Pro Asp Asp Pro 

235 240 245 250 

ctt cac gac aga aag acc eta cac ata cca ccc gat gtt ttc aag aaa 

Leu Tyr Asp Arg Lys Thr Leu His He Pro Pro Asp Val Phe Lys Lys 

255 260 265 



843 



89: 



939 



987 



atg tct gca tea caa aag caa tat egg age gtt aag age gaa tat atg 

Met Ser Ala Ser Gin Lys Gin Tyr Trp Ser Val Lys Ser Glu Tyr Met 
270 275 280 

gac att gcg ctt ttc ttt aaa gtg ggg aaa ttt tat gag ctg tat gag 1035 

Asp He Val Leu Phe Phe Lys Val Gly Lys Phe Tyr Glu Leu Tyr Glu 
235 290 295 



eta gac gcg gaa tea ggc cac aag gag ccc gac tgg aag atg acc atg 

Leu Asp Ala Glu Leu Gly His Lys Glu Leu Asp Trp Lys Met Thr Met 

300 305 310 

age ggc gcg gga aaa cgc aga cag gcc ggc ate tct gaa agt ggg ata 

Ser Gly Val Gly Lys Cys Arg Gin Val Gly He Ser Glu Ser Gly He 

315 320 325 330 



gat gag gca gcg caa aag cCa tta gcc cgc gga tat aaa gtt gga cga 1179 
Asp Glu Ala Val Gin Lys Leu Leu Ala Arg Gly Tyr Lys Val Gly Arg 
335 340 345 



acc gag cag eta gaa aca tee gac caa gca aaa gcc aga ggt get aat 1227 

He Glu Gin Leu Glu Thr Ser Asp Gin Ala Lys Ala Arg Gly Ala Asn 

350 355 360 

act ata acc cca agg aag eta gtt cag gta tta act cca tea aca gca 1275 

Thr He He Pro Arg Lys Leu Val Gin Val Leu Thr Pro Ser Thr Ala 
365 370 375 



age gag gga aac ate ggg cct gat gcc gtc cat etc cet gee aea aaa 1323 

Ser Glu Gly Asn He Gly Pro Asp Ala Val His Leu Leu Ala He Lys 

380 385 390 

gag aee aaa aeg gag eta caa aag cgc eea act gtg tat gga ttt get 1371 

Glu He Lys Met Glu Leu Gin Lys Cys Ser Thr Val Tyr Gly Phe Ala 

395 400 405 410 

ttt gtt gac tgt get gcc ttg agg ttt tgg gtt ggg tec ate age gat 1419 

Phe Val Asp Cys Ala Ala Leu Arg Phe Trp Val Gly Ser He Ser Asp 
415 420 425 
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gac gca tea tgc get get ctt gga gcg tta ttg atg cag gtt tct cca 1467 
Asp Ala Ser Cys Ala Ala Leu Gly Ala Leu Leu Met Gin Val Ser Pro 
430 435 440 

aag gaa gtg tta tat gac agt aaa ggg eta tea aga gaa gca caa aag 1515 
Lys Glu Val Leu Tyr Asp Ser Lys Gly Leu Ser Arg Glu Ala Gin Lys 
445 450 455 

get eta agg aaa tat acg ttg aca ggg tct acg geg gta cag ttg get 1563 
Ala Leu Arg Lys Tyr Thr Leu Thr Gly Ser Thr Ala Val Gin Leu Ala 
460 465 470 

cca gta cca caa gta atg ggg gat . aca gat get get gga gtt aga aat 1611 
Pro Val Pro Gin Val Met Gly Asp Thr Asp Ala Ala Gly Val Arg Asn 
475 480 485 490 

ata ata gaa tct aac gga tac ttt aaa ggt tct tct gaa tea tgg aac 1659 
lie He Glu Ser Asn Gly Tyr Phe Lys Gly Ser Ser Glu Ser Trp Asn 
495 500 505 

tgt get get gat ggt eta aat gaa tgt gat gtt gee ctt agt get ctt 1707 
Cys Ala Val Asp Gly Leu Asn Glu Cys Asp Val Ala Leu Ser Ala Leu 
510 515 520 

gga gag eta att aat eat ctg tct agg eta aag eta gaa gac gta ctt 1755 
Gly Glu Leu He Asn His Leu Ser Arg Leu Lys Leu Glu Asp Val Leu 
525 530 535 

aag cat ggg gat att ttt cca tac caa gtt tac agg ggt tgt etc aga 1803 
Lys His Gly Asp He Phe Pro Tyr Gin Val Tyr Arg Gly Cys Leu Arg 
540 545 550 

att gat ggc cag acg atg gta aat ctt gag ata ttt aac aat age tgt 1851 
lie Asp Gly Gin Thr Met Val Asn Leu Glu He Phe Asn Asn Ser Cys 
555 560 565 570 

gat ggt ggt cct tea ggg acc ttg tac aaa tat ctt gat aac tgt gtt 1899 
Asp Gly Gly Pro Ser Gly Thr Leu Tyr Lys Tyr Leu Asp Asn Cys Val 
575 580 585 

agt cca act ggt aag ega etc tta agg aat tgg ate tgc cat cca etc 1947 
Ser Pro Thr Gly Lys Arg Leu Leu Arg Asn Trp He Cys His Pro Leu 
590 595 600 

aaa gat gta gaa age ate aat aaa egg ctt gat gta gtt gaa gaa tte 1995 
Lys Asp Val Glu Ser He Asn Lys Arg Leu Asp Val Val Glu Glu Phe 
605 610 615 



acg gca aac tea gaa agt atg caa ate act ggc cag tat etc cac aaa 
Thr Ala Asn Ser Glu Ser Met Gin He Thr Gly Gin Tyr Leu His Lys 
620 625 630 



2043 
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etc cca gac 
Leu Pro Asp 
635 

tea tea gcc 
Ser Ser Ala 



caa cga gtc 
Gin Arg Val 



att gat ctg 
lie Asp Leu 
685 

ctt tat aaa 
Leu Tyr Lys 
700 

gag tta ttt 
Glu Leu ?he 
715 

aat tat cag 
Asn Tyr Gin 



ata ctt ate 
lie Leu lie 



att cac acc 
lie His Thr 
765 

gca agt etc 
Ala Ser Leu 
780 

tea gaa get 
Ser Glu Ala 
795 

ate eaa gga 
He Gin Gly 



ect gtt ecg 
Pro Val Pro 



tta gaa aga 

Leu Glu Arg 
640 

tct gtg ttg 

Ser Val Leu 
655 

aaa gca ttt 

Lys Ala Phe 
670 

ttg ttg get 

Leu Leu Ala 



etc tgt aaa 

Leu Cys Lys 



ctt tct caa 
Leu Ser Gin 
720 

aac caa gat 
Asn Gin Asp 
735 

gaa ctt ttt 
Glu Leu Phe 
750 

ata age tge 
He Ser Cys 



tct get gga 
Ser Ala Gly 



aca gat cag 
Thr Asp Gin 
800 

eta tgg eat 
Leu Trp His 
815 

aat gat ata 
Asn Asp He 
830 



ctg etc gga 
Leu Leu Gly 



ect get ctt 
Pro Ala Leu 



ggg caa att 
Gly Gin He 
675 

eta cag aag 
Leu Gin Lys 

690 - 

ctt ect ata 
Leu Pro He 
705 

ttc gaa gca 
Phe Glu Ala 



gtg aca gat 
Val Thr Asp 



ate gaa aga 
He Glu Arg 
755 

eta gat gtc 
Leu Asp Val 
770 

age atg gcc 
Ser Met Ala 
785 

aat cag aaa 
Asn Gin Lys 



eca ttt gca 
Pro Phe Ala 



etc ctt gge 
Leu Leu Gly 
835 



26 

cgc ate aag 

Arg He Lys 
645 

ctg ggg aaa 

Leu Gly Lys 
660 

gtg aaa ggg 

Val Lys Gly 



gaa tea aat 
Glu Ser Asn 



tta gta gga 

Leu Val Gly 
710 

gcc ata gat 
Ala He Asp 
725 

gaa aac gcr 
Glu Asn Ala 
740 

gca act caa 
Ala Thr Gin 



ctg aga tec 
Leu Arg Ser 



agg ect gtt 
Arg Pro Val 
790 

aca aaa ggg 
Thr Lys Gly 
805 

gtt gea gee 
Val Ala Ala 
820 

gag get *aga 
Glu Ala Arg 



tct age gtt 
Ser Ser Val 



aaa gtg ctg 
Lys Val Leu 
665 

ttc aga agt 
Phe Arg Ser 
680 

atg atg agt 
Met Met Ser 
695 

aaa age ggg 
Lys Ser Gly 



age gac tzt 
Ser Asp Phe 



gaa act etc 
Glu Thr Leu 
745 

tgg tct gag 
Trp Ser Glu 
760 

ttt gca ate 
Phe Ala He 
775 

att ttt cce 
He Phe Pro 



cca ata ctt 
Pro He Leu 



gat ggt caa 
Asp Gly Gin 
825 

aga age agt 
Arg Ser Ser 
840 



cga 20 91 

Arg 

650 

aaa 2139 
Lys 



gga 2187 
Gly 



ttg 2235 
Leu 



eta 2283 
Leu 



cca 2331 

Pro 

730 

aca 2379- 
Thr 



gtc . 2427 
Val 



gca 2475 
Ala 



gaa 2523 
Glu 



aaa 2571 

Lys 

810 

ttg 2619 
Leu 



gge 2667 
Gly 
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age ate cat cct egg tea ttg tta ctg aeg gga oca aae atg ggc gga 2715 

Ser He His Pro Arg Ser Leu Leu Leu Thr Gly Pro Asn Met Gly Gly 

845 B50 855 

aaa tea aet ctt ett cgt gca aca tgt etg gee gtt ate ttt gee caa 2763 
Lys Ser Thr Leu Leu Arg Ala Thr Cys Leu Ala Val He Phe Ala Gin 
860 865 870 

ctt ggc tgc tac gtg ceg tgt gag tct tge gaa acc tec etc gtg gat 2811 
Leu Gly Cys Tyr Val Pro Cys Glu Ser Cys Glu He Ser Leu Val Asp 
875 880 885 890 

act ate tte aca agg ctt ggc gca tct gat aga ate atg aca gga gag 2859 
Thr He Phe Thr Arg Leu Gly Ala Ser Asp Arg He Met Thr Gly Glu 
895 900 905 

agt acc ttt ttg gta gaa tgc act gag aca gcg tea gtt ctt cag aat 2907 
Ser Thr Phe Leu Val Glu Cys Thr Glu Thr Ala Ser Val Leu Gin Asn 
910 915 920 

gca acc cag gat tea eta gta ate etc gae gaa etg ggc aga gga act 2955 
Ala Thr Gin Asp Ser Leu Val He Leu Asp Glu Leu Gly Arg Gly Thr 
925 930 935 

agt act tte gat gga cae gcc acc gca tac teg gtt ttt cgt cac ctg 3003 
Ser Thr Phe Asp Gly Tyr Ala He Ala Tyr Ser Val Phe Arg His Leu 
940 945 950 

gta gag aaa gtt caa tgt egg atg etc ttt gca aca cat tac cac cct 3051 
Val Glu Lys Val Gin Cys Arg Met Leu Phe Ala Thr His Tyr His Pro 
955 960 965 970 

etc acc aag gaa tte gcg tct cac eca egc gtc ace teg aaa cac atg 3099 
Leu Thr Lys Glu Phe Ala Ser His Pro Arg Val Thr Ser Lys His Met 
975 980 985 

get tgc gca tte aaa tea aga tct gat tat caa eca cgt ggt tgt gat 3147 
Ala Cys Ala Phe Lys Ser Arg Ser Asp Tyr Gin Pro Arg Gly Cys Asp 
990 995 1000 

caa gae eta gtg tte ttg tac cgt tta acc gag gga get tgt cct gag 3195 
Gin Asp Leu Val Phe Leu Tyr Arg Leu Thr Glu Gly Ala Cys Pro Glu 
1005 1010 1015 

age tac gga ctt caa gtg gca etc atg get gga ata eca aae caa gtg 3243 
Ser Tyr Gly Leu Gin Val Ala Leu Met Ala Gly He Pro Asn Gin Val 
1020 1025 1030 



gtt gaa aca gca tea ggt gcc get caa gcc acg aag aga tea act ggg 
Val Glu Thr Ala Ser Gly Ala Ala Gin Ala Met Lys Arg Ser He Gly 
1035 1040 1045 1050 



3291 
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gga aac etc aag tea agt gag eta aga tct gag tec tea agt ctg cat 3339 

Glu Asn Phe Lys Ser Ser Glu Leu Arg Ser Glu Phe Ser Ser Leu His 

1055 1060 1065 

gaa gac tgg etc aag tea ttg gtg ggt att tet cga gtc gee cac aac 3387 

Glu Asp Trp Leu Lys Ser Leu Val Gly lie Ser Arg Val Ala His Asn 

1070 1075 1080 

aat gee cec att gge gaa gat gac tac gac act ttg ttt tge tta tgg 3435 

Asn Ala Pro lie Gly Glu Asp Asp Tyr Asp Thr Leu Phe Cys Leu Trp 

1085 1090 1095 

cat gag ate aaa tee tet tac tgt gtt cec aaa taaatggcta 3478 
His Glu lie Lys Ser Ser Tyr Cys Val Pro Lys 
1100 1105 

tgacataaea etatetgaag ctcgttaagt cttttgcctc tctgatgttt attcctctta 3538 

aaaaatgett atatateaaa aaattgtrre ctcgattaaa aaaaaaaaaa aaaaaaaaaa 3598 

aaaaaaaa 3 606 



<210> 31 

<211> 1109 

<212> PRT 

<213> Arabidopsis thaliana ecotype Columbia 

<223> Polypeptide MSH6 

<400> 31 

Met Gin Arg Gin Arg Ser lie Leu Ser Phe Phe Gin Lys Pro Thr Ala 
15 10 15 

Ala Thr Thr Lys Gly Leu Val Ser Gly Asp Ala Ala Ser Gly Gly Gly 
20 25 30 

Gly Ser Gly Gly Pro Arg Phe Asn Val Arg Glu Gly Asp Ala Lys Gly 
35 40 45 

Asp Ala Ser Val Arg Phe Ala Val Ser Lys Ser Val Asp Glu Val Arg 
50 55 60 

Gly Thr Asp Thr Pro Pro Glu Lys Val Pro Arg Arg Val Leu Pro Ser 
65 70 75 80 

Gly Phe Lys Pro Ala Glu Ser Ala Gly Asp Ala Ser Ser Leu Phe Ser 
85 90 95 

Asn lie Met His Lys Phe Val Lys Val Asp Asp Arg Asp Cys Ser Gly 
100 105 110 



Glu Arg Ser Arg Glu Asp Val Val Pro Leu Asn Asp Ser Ser Leu Cys 
115 120 125 
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Men Lys Ala Asn Asp Val He Pro Gin Phe Arg Ser Asn Asn Gly Lys 
130 135 140 

Thr Gin Glu Arg Asn His Ala Phe Ser Phe Ser Gly Arg Ala Glu Leu 
145 150 155 160 

Arg Ser Val Glu Asp He Gly Val Asp Gly Asp Val Pro Gly Pro Glu 
165 170 175 

Thr Pro Gly Met Arg Pro Arg Ala Ser Arg Leu Lys Arg Val Leu Glu 
180 185 190 

Asp Glu Met Thr Phe Lys Glu Asp Lys Val Pro Val Leu Asp Ser Asn 
195 200 205 

Lys Arg Leu Lys Met Leu Gin Asp Pro Val Cys Gly Glu Lys Lys Glu 
210 215 220 

Val Asn Glu Gly Thr Lys Phe Glu Trp Leu Glu Ser Ser Arg He Arg 
225 * ' 230 235 240 

Asp Ala Asn Arg Arg Arg Pro Asp Asp Pro Leu Tyr Asp Arg Lys Thr 
245 250 255 

Leu His lie Pro Pro Asp Val Phe Lys Lys Met Ser Ala Ser Gin Lys 
260 265 270 

Gin Tyr Trp Ser Val Lys Ser Glu Tyr Met Asp He Val Leu Phe Phe 
275 280 285 

Lys Val Gly Lys Phe Tyr Glu Leu Tyr Glu Leu Asp Ala Glu Leu Gly 
290 295 300 

His Lys Glu Leu Asp Trp Lys Met Thr Met Ser Gly Val Gly Lys Cys 
305 310 315 320 

Arg Gin Val Gly He Ser Glu Ser Gly He Asp Glu Ala Val Gin Lys 
325 330 335 

Leu Leu Ala Arg Gly Tyr Lys Val Gly Arg He Glu Gin Leu Glu Thr 
340 345 350 

Ser Asp Gin Ala Lys Ala Arg Gly Ala Asn Thr He He Pro Arg Lys 
355 360 365 

Leu Val Gin Val Leu Thr Pro Ser Thr Ala Ser Glu Gly Asn He Gly 
370 375 380 

Pro Asp Ala Val His Leu Leu Ala He Lys Glu He Lys Met Glu Leu 
385 390 395 400 



Gin Lys Cys Ser Thr Val Tyr Gly Phe Ala Phe Val Asp Cys Ala Ala 
405 410 ^ 415 
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Leu Arg Phe Trp Val Gly Ser lie Ser Asp Asp Ala Ser Cys Ala Ala 
420 425 430 

Leu Gly Ala Leu Leu Met Gin Val Ser Pro Lys Glu Val Leu Tyr Asp 
435 440 445 

Ser Lys Gly Leu Ser Arg Glu Ala Gin Lys Ala Leu Arg Lys Tyr Thr 
450 455 460 

Leu Thr Gly Ser Thr Ala Val Gin Leu Ala Pro Val Pro Gin Val Met 
465 470 475 480 

Gly Asp Thr Asp Ala Ala Gly Val Arg Asn He He Glu Ser Asn Gly 
485 490 495 

Tyr Phe Lys Gly Ser Ser Glu Ser Trp Asn Cys Ala Val Asp Gly Leu 
500 505 510 

Asn Glu Cys Asp Val Ala Leu Ser Ala Leu Gly Glu Leu He Asn His 
515 520 525 

Leu Ser Arg Leu Lys Leu Glu Asp Val Leu Lys His Gly Asp He Phe 
530 535 540 

Pro Tyr Gin Val Tyr Arg Gly Cys Leu Arg He Asp Gly Gin Thr Met 
545 550 555 560 

Val Asn Leu Glu He Phe Asn Asn Ser Cys Asp Gly Gly Pro Ser Gly 
565 570 575 

Thr Leu Tyr Lys Tyr Leu Asp Asn Cys Val Ser Pro Thr Gly Lys Arg 
580 585 590 

Leu Leu Arg Asn Trp He Cys His Pro Leu Lys Asp Val Glu Ser He 
595 600 605 

Asn Lys Arg Leu Asp Val Val Glu Glu Phe Thr Ala Asn Ser Glu Ser 
610 615 620 

Met Gin He Thr Gly Gin Tyr Leu His Lys Leu Pro Asp Leu Glu Arg 
^25 630 635 640 

Leu Leu Gly Arg He Lys Ser Ser Val Arg Ser Ser Ala Ser Val Leu 
645 650 555 

Pro Ala Leu Leu Gly Lys Lys Val Leu Lys Gin Arg Val Lys Ala Phe 
660 665 670 

Gly Gin He Val Lys Gly Phe Arg Ser Gly He Asp Leu Leu Leu Ala 
675 680 685 

Leu Gin Lys Glu Ser Asn Met Met Ser Leu Leu Tyr Lys Leu Cys Lys 
690 695 700 
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Leu Pro lie Leu Val Gly Lys Ser Gly Leu Glu Leu Phe Leu Ser Gin 
705 710 715 720 

Phe Glu Ala Ala lie Asp Ser Asp Phe Pro Asn Tyr Gin Asn Gin Asp 
725 730 735 

Val Thr Asp Glu Asn Ala Glu Thr Leu Thr lie Leu lie Glu Leu Phe 
740 745 750 

He Glu Arg Ala Thr Gin Trp Ser Glu Val He His Thr He Ser Cys 
755 760 765 

Leu Asp Val Leu Arg Ser Phe Ala He Ala Ala Ser Leu Ser Ala Gly 
770 775 780 

Ser Met Ala Arg Pro Val He Phe Pro Glu Ser Glu Ala Thr Asp Gin 
785 790 795 800 

Asn Gin Lys Thr Lys Gly Pro He Leu Lys He Gin Gly Leu Trp His 
805 810 815 

Pro Phe Ala Val Ala Ala Asp Gly Gin Leu Pro Val Pro Asn Asp He 
820 825 830 

Leu Leu Gly Glu Ala Arg Arg Ser Ser Gly Ser He His Pro Arg Ser 
835 840 845 

Leu Leu Leu Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Leu Leu Arg 
850 855 860 

Ala Thr Cys Leu Ala Val He Phe Ala Gin Leu Gly Cys Tyr Val Pro 
865 870 875 880 

Cys Glu Ser Cys Glu He Ser Leu Val Asp Thr He Phe Thr Arg Leu 
885 890 895 

Gly Ala Ser Asp Arg He Met Thr Gly Glu Ser Thr Phe Leu Val Glu 
900 905 910 

Cys Thr Glu Thr Ala Ser Val Leu Gin Asn Ala Thr Gin Asp Ser Leu 
915 920 925 

Val He Leu Asp Glu Leu Gly Arg Gly Thr Ser Thr Phe Asp Gly Tyr 
930 935 940 

Ala He Ala Tyr Ser Val Phe Arg His Leu Val Glu Lys Val Gin Cys 
945 950 955 960 

Arg Met Leu Phe Ala Thr His Tyr His Pro Leu Thr Lys Glu Phe Ala 
965 970 975 



Ser His Pro Arg Val Thr Ser Lys His Met Ala Cys Ala Phe Lys Ser 
980 985 990 
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Arg Ser Asp Tyr Gin Pro Arg Gly Cys Asp Gin Asp Leu Val Phe Leu 
995 1000 1005 

Tyr Arg Leu Thr Glu Gly Ala Cys Pro Glu Ser Tyr Gly Leu Gin Val 
1010 1015 1020 

Ala Leu Met Ala Gly He Pro Asn Gin Val Val Glu Thr Ala Ser Gly 
1025 1030 1035 1040 

Ala Ala Gin Ala Met Lys Arg Ser He Gly Glu Asn Phe Lys Ser Ser 
1045 1050 1055 

Glu Leu Arg Ser Glu Phe Ser Ser Leu His Glu Asp Trp Leu Lys Ser 
1060 1065 1070 

Leu Val Gly lie Ser Arg Val Ala His Asn Asn Ala Pro He Gly Glu 
1075 1080 1085 

Asp Asp Tyr Asp Thr Leu Phe Cys Leu Trp His Glu He Lys Ser Ser 
1090 1095 1100 

Tyr Cys Val Pro Lys 
1105 



<210> 32 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PGR amplification of ATHGENEA 
microsatellite 

<400> 32 



accatgcata gcttaaactt cttg 



<210> 33 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PGR amplification of ATHGENEA 
microsatellite 

<:400> 33 



acataaccac aaaraggggt gc 
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<210> 34 
<211> 18 
<212> DNA 

<2i3> Artificial sequence 

<220> 

<223> Forward primer DMCIN-A for PGR on genomic DNA of Arabidopsis 

thaliana ssp. Landsberg erect a "Ler" 

<400> 34 

gaagcgatat tgttcgtg 18 

<210> 35 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer DMCIN-B for PGR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 

<400> 35 

agaccgcgac aacactcc 18 

<210> 36 

<211> 31 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer DMCIN-1 for PGR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 

<400> 36 

acgcgtcgac tcagctatga gattactcgt g 31 

<210> 37 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer DMCIN-2 for PGR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 

<400> 37 

gctctagatt tctcgctcta agactctct 29 
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<210> 38 

<211> 32 

<212> DNA 

<213> Arcificial sequence 
<220> 

<223> Forward primer DMCIN-3 for PGR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg areata "Ler" 

<400> 38 

gctctagagc ttctcttaag taagtgattg at 32 

<210> 39 

<211> 48 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer DMClN-4 for PGR on genomic DNA of Arabidopsis 
uhaliana ssp. landsberg erecca "Ler" 

<400> 39 

tcccccgggc tcgagagatc tccazqqzzt cttcagcrct atgaatcc 4 8 

<210> 40 

<211> 26 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer DMGla for PGR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 

<400> 40 

acgcgtcgac gaattcgcaa gtgggg 26 



<210> 41 

<211> 38 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer DMClb for PGR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 



<400> 



41 
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cccacggaga ccccccgggt accgatttgc ttcgaggg 3 8 

<210> 42 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PGR amplification of ATEATl SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 42 

gccactgcgt gaatgatatg 20 

<210> 43 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PGR amplif icacion of ATEATl SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 43 

cgaacagcca acatcaattc cc 22 

<210> 44 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PGR amplification of NGA63 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 44 

aaccaaggca cagaagcg 18 



<210> 45 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PGR amplification of NGA63 SSLP marker in 
Arabidopsis thaliana subspecies 
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<400> 45 

acccaagtga tcgccacc IS 

<210> 46 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA248 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 46 

taccgaacca aaacacaaag g 21 

<210> 47 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<:220> 

<223> Reverse primer for PCR amplification of NGA248 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 47 

tctgtatctc ggtgaattct cc 22 

<210> 48 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA128 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 48 

ggtctgttga tgtcgtaagt eg 22 



<210> 49 

<211> 22 

<212> DNA 

<213> Artificial sequence 



<220> 
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<223> 


Reverse primer for PCR amplification of 
ArabidODsis thaliana subsnpni *a<5 


NGA12 8 


SSLP 


marker in 


<400> 


49 








accttgaaac 


ccttagggag gg 






22 


<210> 
<211> 
<212> 
<213> 


50 
22 
DNA 

Art i f i c ial sequence 








<220> 
<223> 


Forward primer for PCR amplification of 


NGA28 0 


SSLP 


marker in 


<400> 


50 








ctgatcccac 


ggacaatagt gc 






22 


<210> 
<211> 
<212> 
<213> 


51 
20 
DNA 

Artificial sequence 








<220> 
<223> 


Reverse primer for PCR amplification of 

r^X aiJ X UU^s X 9 U.ldX Xallo. SUPSpeC X 6 S 


NGA2 80 


SSLP 


marker in 


<400> 


51 








ggctccacaa 


aaagtgcacc 






20 


<210> 
<211> 
<212> 
<213> 


52 
21 
DNA 

Artificial sequence 








<220> 
<223> 


Forward primer for PCR amplification of 
ArabidoDsis thaliana subsnear-i 


NGAlll 


SSLP 


marker in 


<400> 


52 








ctccagttgg 


aagctaaagg g 






21 


<210> 
<211> 
<212> 
<213> 


53 
21 
DNA 

Artificial sequence 
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<22Q> 

<222> Reverse primer for PCR amplification of NGAlll SSLP marker in 

Arabidopsis thaliana subspecies 

<:400> 53 

tgttttttag gacaaatggc g 21 

<210> 54 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA168 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 54 

ccttcacatc caaaacccac 2.0 

<210> 55 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA168 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 55 

gcacataccc acaaccagaa 20 



<210> 56 

<211> 20 

<212> DNA 

c2l3> Artificial sequence 

<220> 

<222> Forward primer for PCR amplification of NGA1126 SSLP marker 
in Arabidopsis thaliana subspecies 

<400> 56 



cgctacgctt ttcggtaaag 



20 
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<210> 57 
<211> 20 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Reverse primer for PCR amplification of NGA1126 SSLP marker 

in Arabidopsis thaliana subspecies 

<400> 57 

gcacagtcca agtcacaacc 20 

<210> 58 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA361 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 58 

aaagagatga gaatttggac 20 

<210> 59 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA361 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 59 

acatatcaat atattaaagt age 23 



<210> 60 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA168 SSLP marker in 
Arabidopsis thaliana subspecies 



<400> 



60 
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ccgtctaccg cactgccg 1-8 

<210> 61 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PGR amplification of NGA168 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 61 

gaggacatgt ataggagcct eg 22 

<210> 62 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PGR amplification of AthBI02 SSL? marker 
in Arabidopsis thaliana subspecies 

<400> 62 

tgacctcctc ttccatggag 2 0 

<210> 63 

<211> 22 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Reverse primer for PGR amplification of AthBI02 SSLP marker 

in Arabidopsis thaliana subspecies 

<400> 63 

ttaacagaaa cccaaagctt tc 22 



<210> 64 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 
<223> 



Forward primer for PGR amplification of AthUBIQUE SSLP marker 
in Arabidopsis thaliana subspecies 



wo 99/19492 



PCT/EP98/06977 



41 



<400> 


D *i 






aggcaaacgt 


cp(^ t" r r ca 1 1 a 

w W U L> l>« W 0 *v V* ^ 




^ 1 


<210> 


65 






<211> 


20 






<212> 


DNA 






<213> 


Artificial sequence 






<220> 








<223> 


Reverse primer for PGR amplification 

n ^ X 0 X 0 L.1 Id X J. diicL Id UOSPSC J.SS 


of 


AthUBIQUE SSLP marker 


<400 > 


D D 












20 


<210 > 








<211> 


21 






<212> 


DNA 






<213> 


Artificial s<^c:^^v.z^ 






<220> 








<223> 


Forward primer for PGR amplification 
cijjxvaij^o X 0 uiicl±la.ria su-DSDecies 


of 


NGA172 SSLP marker in 










acre? f" cfp^ r or* 


t" ^ a +* a rrr^r^ ^ 

t- L. a t.a.y Cy L. C C 




21 


<210> 


67 






<211> 


19 






<212> 


DNA 






<213> 


Artificial sequence 







<220> 

Reverse primer for PGR amplification of NGA172 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 67 

catccgaatg ccattgttc 19 



<210> 68 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 
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<223> Forward primer for PGR amplif icacion of NGA126 SSLP marker in 

Arabidopsis chaliana subspecies 

••:400> 63 

gaaaaaacgc taccttcgtg g 21 

<2I0> 69 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PGR amplification of NGA126 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 69 

caagagcaat atcaagagca gc 22 

<210> 70 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PGR amplification of NGA162 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 70 

catgcaattt gcatctgagg 20 

<210> 71 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PGR amplification of NGA162 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 71 

ctctgtcact cttttcctct gg 22 



<210> 72 

<211> 21 

<212> DNA 

<213> Artificial sequence 
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<220> 
<223> 



<400> 



Forward primer for PCR amplification of NGA6 SSLP marker in 
Arabidopsis thaliana subspecies 

72 



tggatttctt cccctcttca c 



21 



<210> 
<211> 
<212> 
<213> 



73 
21 
DNA 

Artificial sequence 



<220> 
<223> 



Reverse primer for PCR amplification of NGA6 SSLP marker in 
Arabidopsis thaliana subspecies 



<400> 

atggagaagc 



73 



:tacactgat c 



21 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



74 
20 
DNA 

Artificial sequence 



Forward primer for PCR amplification of NGA12 SSLP marker in 
Arabidopsis thaliana subspecies 



<400> 



74 



aatgttgtcc tcccctcctc 



20 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



75 
22 
DNA 

Art i f i c ial sequence 

Reverse primer for PCR amplification of NGA12 SSLP marker in 
Arabidopsis thaliana subspecies 



<400> 



75 



tgatgctctc tgaaacaaga gc 



22 
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<210> 76 
<211> 21 
<212> DMA 

<213> Artificial sequence 

<220> 

<223> Forward primer for PCR amplif icacion of NGA8 SSLP marker in 

Arabidopsis thaliana subspecies 

<400> 76 

gagggcaaat ctttatttcg g 21 



<210> 77 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<;220> 

<223> Reverse primer fcr PCR amplification of NGA8 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 77 

tggctttcgt ttataaacat cc ^ 22 



<210> 78 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<^^^> Forward primer for PCR amplification of NGA1107 SSLP marker 
in Arabidopsis thaliana subspecies 



21 



<400> 


78 


gcgaaaaaac 


aaaaaaatcc a 


<210> 


79 


<211> 


21 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Reverse primer for 



in Arabidopsis thaliana subspecies 



<400> 



79 
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cgacgaaccg 


acagaaccag g 


<210> 


80 


c211> 


21 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


rox^wara primer ror fCR amplif icatxon 
Arabidopsis thaliana subspecies 


<400> 


80 


gaaatccaaa 


tcccagagag g 


<210> 


8 1 


<211> 


22 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<2 23 > 


Reverse primer for PCR amplification 
Arabidopsis thaliana subspecies 


<400> 


81 


tctccccacc 


agttttgtgt cc 


<210> 




<211> 


19 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


i?orwara primer tor PCR amplification 
Arabidopsis thaliana subspecies 


<400> 


82 


taccgtcaat 


ttcatcgcc 


<210> 


83 


<211> 


22 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Reverse primer for PCR amplification ^ 
Arabidopsis thaliana subspecies 



21 



21 



marker in 



22 



m 



19 
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<400> 83 

ggatccccaa ctgtaaaatc cc 22 

<210> 84 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PGR amplification of CA72 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 84 

aaccccagta accaaacaca ca 22 

<210> 85 

<211> 20 

<212> DNA 

<2i3> Artificial sequence 
<220> 

<223> Reverse primer for PGR amplification of CA72 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 85 

cccagtctaa ccacgaccac 20 

<210> 86 

<211> 20 

c212> DNA 

<213> Artificial sequence 
<220> 

<222> Forward primer for PGR amplification of NGA151 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 86 

gttttgggaa gttttgctgg 20 



<210> 87 

<211> 24 

<212> DNA 

<213> Artificial sequence 



<220> 
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<223> 


Rpvp ITS© nirimfaT" f rs-r PCR amnl *i f i ^ rtn nf MnA 1^1 
Arabidopsis thaliana subspecies 


SSLP 


maT'k'^r* in 
iimx jvw^ xii 




87 






cagcctaaaa 


gcgagagtat gatg 




24 


<210> 
<211> 
<212> 
<213> 


>88 
22 
DNA 

Artificial sequence 






<220> 
<223> 


Forward orimer for PCR Amnl i f i rat* i nn of Nrr;ai 
Arabidopsis thaliana subspecies 


DOXilr 


UiaxjvSX 


<400> 


63 






gttatggagt 


ttctagggca eg 




22 


<c210> 
<211> 
<212> 
<213> 


89 
20 
DNA 

Arc i f ic ial sequence 






<220> 
<223> 


i^.c;v^A,oc ^i.^iitci. i.u'i. £rw*v cLUi^ X X X X u>cl L. X L^n OX IMwr^X U D 

Arabidopsis thaliana subspecies 


ooxiir 


marKer m 


<400> 


89 






tgccccacct 


cgttcttctc 




20 


<210> 
<211> 
<212> 
<213> 


90 
20 
DNA 

Artificial sequence 






<220> 
<223> 


Forward Drimer for pcr amnlif •Lra^^ nn nf MfiaiiQ 
Arabidopsis thaliana subspecies 


CQT D 
ooxiir 


marjcer m 


<400> 


90 






agagccacca 


gatccgatgg 




20 


<210> 
<211> 
<212> 
<213> 


91 
21 
DNA 

Artificial sequence 
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<220> 

<223> Reverse primer for PCR amplification of NGA13 9 SSLP marker in 

Arabidopsis thaliana subspecies 

<400> 91 

ggcctcgttt cactatccag g 21 

<210> 92 

<211> 22 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Forward primer for PCR amplification of NGA76 SSLP marker in 

Arabidopsis thaliana subspecies 

<400> 92 

ggagaaaatg tcactctcca cc 22 

<210> 93 

<211> 20 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Reverse primer for PCR amplification of NGA76 SSLP marker in 

Arabidopsis thaliana subspecies 

<400> 93 

aggcatggga gacatttacg 20 



<210> 94 

<211> 20 

<212> DNA 

<213> Artificial sequence 



<220> 
<223> 



Forward primer for PCR amplification of ATHS0191 SSLP marker 
in Arabidopsis thaliana subspecies 



<400> 94 



ctccaccaat catgcaaatg 2 0 
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<210> 95 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PGR amplification of ATHS0191 SSLP marker 
in Arabidopsis thaliana subspecies 

<400> 95 

tgatgttgat ggagatggtc a 21 



<210> 96 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PGR amplification of NGA129 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 96 

tcaggaggaa ctaaagtgag gg 22 



<210> 97 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PGR amplification of NGA129 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 97 

cacactgaag atggtcttga gg 22 



<210> 98 

<211> 8062 

<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 
<220> 

<223> Genomic DNA sequence of AtMSH6 

<400> 97 



ttttttggtt gctaacaata aaggtatacg gttttatgtc atcaatataa ctatatataa 6 0 
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aagaaatgaa agacatatat tgtcttccca tttatcaaac aaaacaacaa gacctttctt 120 

ttactttcta catcggtcaa caaaacacaa gataaacgac accgtctaac cacctcccaa 180 

tcttacccct aagtttaaca cctagaacct tctccatctt cgcaagcaca gcctgattag 240 

gaacagcctt accatcccca tattcctgaa ctacctgagt cctctcattg atctgtttcg 300 

ccaaatccgc ttgtgacacc ttcttctcca atctcgccct ctgcaccacc aacctcacct 360 

ctgctttcac acgatccatc gccgcaggct ctgtctcttc ttccagcttc ttcgtgttaa 420 

tcaccggaac cgccgtagat ttcccctttt tgttcgaacc ggcaccgaat ttcttaaccg 480 

tttgaaccgc gacaccgttt ctcagagctg cgttaaccgc tcccggatcg cgcaggtctc 540 

ggctcttctg ttttgatttg tggagaacta ctggttccca gtcctgtgtt actgctcctg 600 

ggtatctgct cggcatcgtc gatgaactga gagaaaggaa caacgcgaaa atttcactaa 660 

tctgagtntt gaaattgaga aacgacgaag atgaagaatg crgtcgagag gatcgcgata 720 

tttacacata cgaagactgg tccctggaga atccgaccac ctccttctcc atcttcgtct 780 

ctggaacgtt cttagagatg attgacgacg tgtcactatc cgacctgcag ttaaccaatg 840 

cttcctgggt tggattcgtg gtacaccata ttatccgatt tggctcaatg gttctacata 900 

aattcggntt tcggttcggt tatgagttat catcaaaact aagccaacca aaaattctcg 960 

taaaatttat ttcggttcca attcggatcc cttacctcca gaaccgaatt attcgaaacc 1020 

ggggctagcc gaaccgaata ccaatgcctg attgactcgt cggccagaaa gatccaacgg 1080 

tatacaataa tagaacataa atcggacggt catcaaagcc tcaaagagtg aacagtcaac 1140 

aaaaaaagtt gagccctgag gagtaccgtt tccgccattt ctacgacgca aggcgaaaat 1200 

ttttggcgcc aatctttccc ccctttcgaa ttctctcagc tcaaaacatc gtttctctct 1260 

cactctctct cacaattcca aaaaatgcag cgccagagat cgactttgtc tttcttccaa 1320 

aaacccacgg cggcgactac gaagggtttg gtctccggcg atgccgctag cggcgggggc 1380 

ggcagcggag accacgattt aatgtgaagg aaggggacgc taaaggcgac gcttctgtac 1440 

gttttgctgt ttcgaaacct gtcgatgagg ttagaggaac ggatactcca ccggagaagg 1500 

ttccgcgtcg tgtcctgccg tctggattta agccggctga atccgccggt gatgcctcgt 1560 

ccctgttctc caatattatg cataagcttg taaaagtcga tgaccgagat tgttctggag 1620 

agaggtacta atcttcgact ctctcaattt tgtcatcttt agccggaaga agaagactcg 168 0 
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cgtaacttgc cgcactcgtt ggagagattc cgattactgc attggatcgt cgcttacaaa 1740 

ctttcaggag ccgagaagat gttgttccgc tgaacgatcc atctctatgt acgaaggcta 1800 

acgatgctat tcctcaatct cgttccaaca atggtaaaac tcaagaaaga aaccatgctt 1860 

tcagtcccag tgggagagct gaactcagat cagtagaaga tataggagta gatggcgatg 192 0 

ctcctggtcc agaaacacca gggatgcgcc cacgtgcttc tcgcttgaag cgagttctgg 1980 

aggatgaaat gacttttaag gaggataagg ttcctgtatt ggaccccaac aaaaggctga 2040 

aaatgctcca ggatccggtt tgtggagaga agaaagaagt aaacgaagga accaaatttg 2100 

aatggcttga gccttctcga atcagggatg ccaatagaag acgtcctgat gatccccttt 2160 

acgatagaaa gaccttacac ataccacctg atgttttcaa gaaaatgtct gcatcacaaa 2220 

agcaatattg gagtgttaag agtgaacata tggacattgt gcttttcttt aaagtggtta 2280 

gtaactatta acccagtgtt caatccattt cctcaatgtg atttgttcac ttacatctgt 2340 

Ctacgttatg cccttcccag gggaaatttt atgagctgta tgagctagat gcggaattag 2400 

gtcacaagga gcctgactgg aagatgacca tgagtggtgt gggaaaatgc agacaggtaa 24 60 

attagttgaa acaactggcc tgcttgaatt attgtgtcta taaattttga caccaccttt 2520 

tgttccaggt tggtatctct gaaagtggga tagatgaggc agtgcaaaag ctattagctc 25 80 

gcgggcaagg gaaccatcac actttatgga attcgtttac tgccacttcg gctaggactc 2640 

aagaaatgga aatcacttca agcatcatta gtcaggatcc tgagaactca ggatgttctc 2700 

ttattcgtta tataataagt cttttcatca aggagtaaca aacaaaactt gcacaatatc 2760 

tgcgtgccca ccggcaaggc atatataccc agctaacctt tgctagttca ctgtagtaac 2820 

agttacggat aatatatgtt tacttgtacg tggtaccctc attttgtctc tcatggaggc 2880 

tttcaagcct tgcgttgaaa ctggacagtt acatatgctc ccaacagaaa ctagcatgca 2940 

gattcatacg ctttcctatt ctactaatta tgtattgaca cactcgttgt ttcttttgaa 3000 

agatataaag ttggacgaat cgagcagcta gaaacatctg accaagcaaa agccagaggt 3060 

gctaatactg taagttttct cggacaggcc aaggagagtg ttgcagactg tttttgatca 3120 

tttctttttc cgtacaccac tttcatgctg taaccaactc aatggccatt ctggtctgac 3180 

tatcagataa ctccaaggaa gctagctcag gtattaactc catcaacagc aagcgaggga 3240 

aacatcgggc ccgatgccgt ccatcctctt gctataaaag aggtttgtta tttacttatt 3300 



wo 99/19492 PCT/EP98/06977 

52 

taccccacca tgttcagttc atccaagccc tgaaaaatta cactcttctt taccaatctt 3360 

ccatcaagct gtgtaaagga tttggaatta gaaaatcatt atctgatgct ttgttttata 3420 

tgcaagaggc ccccttgaaa agatctgttt aagattcttt gcacttgaaa aattcaatct 3480 

cttcaagtga atcccctact ttcttacaat gatcacagtc tgcaattgca cgtcaagtaa 3540 

tatcattcct tgttactgca tccccctctt tcttaatgac cattgtctat gttgtgtttg 3600 

tctcgtgtgc tggagaaaat gatagctgat ccaagctgta cattatcatg attaagtagc 3660 

tgcccaggaa ttgcctttgg ttacattgcc taatggtttg atgtcaattt ttcttctgaa 3720 

cccttatttt agatcaaaat ggagctacaa aagtgttcaa ctgtgtatgg atttgctttt 3780 

gttgaccgtg ctgccttgag gttttgggct gggtccatca gcgatgatgc atcatgtgct 3840 

gctcttggag cgttattgat gcaggtaagc aagtgtatcc tgcatcttat gtgtaccatg 3900 

tgacttcctg tgcatatact tgggttgcag gaactaattc tgaatcacca tttggtatgt 3960 

tttccccagg cttctccaaa ggaagcgcca tacgacagca aaggcaaacc gcttgtaccg 4020 

ccagctgttt tgttaaacag aatttaaggt aaatgacact ggttaattta aagtgcatac 4080 

atgtcgaaat attgcagggc tatcaagaga agcacaaaag gctctaagga aatatacgc't 4140 

gacaggcacc atttcagtag gcaagccaac cgacaattta accgctcacc gaatgatagg 42 00 

tctcccaaac actgctaatg tagatgatgt ttatgtttca atctaatagg gtctacggcg 4260 

gtacagccgg ctccagtacc acaagtaacg ggggatacag atgccgctgg agttagaaat 43 2 0 

ataacagaat ctaacggata ctttaaaggt tcttctgaat catggaactg cgctgttgat 43 80 

ggtccaaatg aatgtgatgt tgcccctagt gctcttggag agctaattaa tcatctgtcc 4440 

aggctaaagg tgtgttggct tgtttagttt ttgcttttca caaattaagc aaaggaactt 4500 

ttcataactt acagtttcta tctacttgca gctagaagat gtacttaagc atggggatat 4560 

ttttccacac caagtttaca ggggttgtct cagaattgat ggccagacga tggtaaatct 4620 

tgagatattt aacaatagct gtgatggtgg tccttcaggc aagtgcatat ttcttttttg 4680 

ataacctcaa ctagagggca gacatagaag gaaaaattct aatacttcgt acggatcccc 474 0 

agtaagtaat agccgatttt tgtttaccta tgtagggacc ttgtacaaat atcttgataa 4800 

ctgtgctagt ccaactggta agcgactctt aaggaattgg atctgccatc cactcaaaga 486 0 

tgtagaaagc atcaacaaac ggcttgatgt agttgaagaa ttcacggcaa actcagaaag 4 92 0 
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tacgcaaatc actggccagc 
caccaagcct agcgttcgat: 
gctgaaacaa cgagtaagta 
acaggactaa aacattacgg 
ttattcatta catttctctt 
gaaagggttc agaagtggaa 
gagtttgctc tataaactct 
atttctttct caattcgaag 
ctatctttca tactttacaa 
gatctcaacc cacaccccga 
gatggtttga tttatttcta 
agatgaaaac gctgaaactc 
acggtccgag gtcattcaca 
agcaagcctc tctgctggaa 
agatcagaat cagaaaacaa 
tgcagttgca gccgatggtc 
aagaagcagt ggcagcattc 
aaaatcaact cctcttcgtg 
cgttagataa ttactctatt 
ttctgtctgc agcttggctg 
actatcttca caaggcttgg 
tctcaaaata ccaattcctc 
tttgcttttt tttaggtacc 
atgcaactca ggattcacta 
atggacacgc cattgcatac 
caacaaaaac atgcaattca 
cctggtagag aaagttcaat 
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atct'ccacaa acttccagac ttagaaagac tgctcggacg 4 980 

catcagcctc tgtgtcgcct gctcttctgg ggaaaaaagc 504 0 

tcaatcacaa gttttctgag taatgccttc cacgagcagt 5100 

gtctagctaa agactgttct ccttcttttg caatgtctgg 5160 

aacttattgc attgcaggtt aaagcatttg ggcaaactgt 5220 

ttgatctgtt gttggctcta cagaaggaat caaatatgat 5280 

gtaaacttcc tacattagta ggaaaaagcg ggctagagtt 5340 

cagccataga tagcgacttt ccaaattatc aggtgcccat 54 0 0 

caaaatgtct gtcactactc aaagcaatgc atatggctta 5460 

ggatcctaaa gggatttgct ttttattcct aatgtttttg 5520 

acttgaactt attaaccttg taccagaacc aagatgtgac 5580 

tcacaatact tatcgaactt cccatcgaaa gagcaactca 5640 

ccataagctg cctagatgtc ctgagatctt ttgcaatcgc 5700 

gcatggccag gcctgttact tttcccgaat cagaagctac 5760 

aagggccaat acttaaaatc caaggaccat ggcatccatt 5820 

aattgcctgt tccgaatgat atactccttg gcgaggctag 5880 

atcctcggtc attgttactg acgggaccaa acatgggcgg 5 94 0 

caacatgtct ggccgttatc tttgcccaag tttgtatact 6000 

ctttgcaatc agttcttcaa catgaataat aaattctgtt 6060 

ctacgtgccg tgtgagtctt gcgaaatctc cctcgtggat 612 0 

cgcatctgat agaatcatga caggagagag taagttttgt 618 0 

gaactattta ctcagatttt gtctgattgg acaaggtggt 624 0 

tttttggtag aatgcactga gacagcgtca gttcttcaga 630 0 

gtaatccttg acgaactggg cagaggaact agtactttcg 6360 

tcggcaacct gctcttctcc ttcaacttat acttgttgat 6420 

ttttgctgaa acttattgat ttatatcagg tttttcgtca 648 0 

gtcggatgct ctttgcaaca cactaccacc ccctcaccaa 654 0 
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ggaattcgcg tctcacccac gtgtcacctc gaaacacatg gcttgcgcat tcaaatcaag 6600 

acccgattac caaccacgtg gccgtgatca agacctagtg ctcccgcacc gttcaaccga 6660 

gggagcctgt cctgagagct acggacttca agtggcactc acggctggaa caccaaacca 6720 

agtggttgaa acagcatcag gtgctgctca agccatgaag agatcaattg gggaaaactt 6780 

caagtcaagt gagctaagat ctgagttctc aagtctgcat gaagactggc tcaagtcatt 6840 

ggtgggtatt tctcgagtcg cccacaacaa tgcccccatt ggcgaagatg actacgacac 6900 

tttgttttgc ttatggcatg agatcaaatc ctcttactgt gttcccaaat aaatggccac 6 960 

gacataacac tatctgaagc tcgttaagtc ttttgcttct ctgatgttta Ctcctcttaa 7020 

aaaatgctta tatatcaaaa aattgtttcc tcgattataa caagattata tatgtatctg 7080 

tcggtttagc tatggtatat aatatacgta tgctcatgag attggccaag agaaatactc 7X40 

acaaacagta tattaagaag gaaatatgtt tacgcattaa tttaagtttc aagataaact 7200 

gcaaataacc ccgaccaaag ttgcaaagac caaacacaaa ttacaaaact tataagactt 7260 

aagttctgaa ttccctaaaa ccaaaaaaaa aaacagaaca catcttgctg catctacaaa 7320 

caacacaaac ctacatagtt tataacttac tcatcactga gactaacatc agaatcattc 7380 

tccatttctt catcttcact ctcatcacca tcaccaccac catgatgatt ctcctcctcc 7440 

tcacgcaacc tagcaatccc accctgagct ctatcaacaa tctgcttctt ctgcaactcc 750 0 

aaatctctct gaaaatcagc cctcaccttc tccaactcct ccacctgctc cttcctactc 7560 

ttctccatct tctcataaac cttcccaaac ctctcaacag aatccgccaa catcttatac 762 0 

gaagcagcgt cattaacctt cttcctctcg tactcaacct catcatcctc atcctcctcc 768 0 

tcttcagaat caccaggact atccaccatc tcatcaaacc caccagactt atctaaataa 7740 

accttagtgt tcataaacac aaactcacct gaatcaacac cacaagctaa acctaaatcc 780 0 

gacttgggcg aaacacaaag caacatatcc aacttattga aaaacgacca cttacttgaa 786 0 

cctaaacctg atttctcaac cttaatcttc tcttttccat acttcctctt caagtcatca 7920 

atcattctcc tacattgcgt ctcagatttc tccatcctta gctcctcact cactttctca 7980 

gctacttcat tccaatcctc gttcctcaaa ccccttctac ccaattgcaa aaacctatct 8040 

ccccaaactt caagcaacac aa 80 62 
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